Welcome to RemoteYeah 2.0! Find out more about the new version here.

Remote Principal Data Engineer

at RYZ Labs

Posted 1 week ago | 0 applied

Description:

  • Ryz Labs is seeking a Principal Data Engineer to design, build, and own modern data platforms from end-to-end.
  • The role involves developing, testing, and maintaining robust batch and streaming data pipelines using Python, SQL, and Spark/Databricks.
  • Responsibilities include provisioning and evolving cloud resources (AWS, Azure, or GCP) using Infrastructure as Code (IaC) frameworks like Terraform or CloudFormation.
  • The engineer will enforce software-craft disciplines such as version control, automated testing, CI/CD, and code reviews while writing clean, idiomatic, and well-documented Python code.
  • The position requires producing clear architectural diagrams, architectural decision records (ADRs), and technical design documents to guide implementation and onboarding.
  • The engineer will monitor and optimize data pipelines, build dashboards, tune performance, and manage incident response and root-cause analysis.
  • A key responsibility is to champion data quality and governance by implementing validation, lineage, and observability to ensure trustworthy data assets.
  • The role involves collaborating cross-functionally with product, data science, and platform teams to translate business questions into scalable data solutions.

Requirements:

  • A minimum of 6 years of experience in data engineering with proven production use of Databricks, including tuning examples.
  • Candidates must hold certifications such as Databricks certified data engineers and cloud-specific data engineering certifications.
  • Mastery in Python, including object-oriented programming (OOP), type hints, packaging, and performance tuning is required.
  • Expertise in Spark, particularly in performance tuning and data engineering optimizations, is essential.
  • Candidates must possess expert-level SQL skills and experience in data model design for relational databases.
  • Proficiency in Infrastructure-as-Code tools (Terraform, CDK, or ARM) and CI/CD tooling (GitHub Actions, Jenkins, Azure DevOps) is necessary.
  • Knowledge of observability and monitoring tools (Datadog, Prometheus, OpenTelemetry) for data services is required.
  • Candidates should be well-versed in setting up data quality and data integrity pipelines and comfortable with writing design documents.

Benefits:

  • RYZ Labs offers a remote and distributed work environment, allowing for flexibility and autonomy in your role.
  • Employees will have opportunities for learning, growth, and engagement in challenging projects.
  • The company fosters a culture of collaboration, where team members can share and learn from each other.
  • RYZ Labs emphasizes a customer-first mentality, ownership, and a bias for action, creating a dynamic and impactful work atmosphere.
  • The organization values humility, respect, frugality, and continuous improvement, encouraging employees to raise their standards and deliver efficient results.