Remote Site Reliability Engineer, Cloud

Posted

This job is closed

This job post is closed and the position is probably filled. Please do not apply.  Automatically closed by a robot after apply link was detected as broken.

Description:

  • At Yugabyte, we are on a mission to become the default transactional database for enterprises building cloud-native applications.
  • YugabyteDB is our PostgreSQL-compatible distributed database designed for cloud-native apps, which is resilient, scalable, and flexible.
  • The role of Site Reliability Engineer focuses on database availability and reliability, operating and automating the life cycle of the YugabyteDB DBaaS.
  • Responsibilities include designing and building processes to manage databases using secure, reliable, scalable, and highly observable methodologies.
  • The engineer will work with Kubernetes environments (GKE, EKS, AKS), Java frameworks, Shell scripts, Python scripts, Terraform templates, and other cloud technologies.
  • Participation in the on-call rotation for 12 hours a day over 7 days, every 4-5 weeks, is required to manage incidents on the DBaaS infrastructure.
  • The role involves diagnosing problems with database and infrastructure technology to deliver reliable service to customers.
  • We seek strong engineers who exemplify collaboration, teamwork, and empathy, and thrive in a fast-paced startup environment.

Requirements:

  • Candidates must have strong software design and implementation skills in building infrastructure frameworks.
  • Experience in building and operating data systems for production applications, including fault-tolerant designs and automation of critical operations, is required.
  • A strong track record of Incident Response and Management in a managed service that is mission-critical for customers is essential.
  • Experience with relational database systems, preferably PostgreSQL, is necessary.
  • Familiarity with public cloud infrastructure (AWS, GCP, and/or Azure) is required.
  • Knowledge of containerization tooling, theory, and design (Docker, Kubernetes) is essential.
  • Experience with Infrastructure as Code, preferably Terraform, is required.
  • Familiarity with Configuration Management Tooling, preferably Ansible, is necessary.
  • Proficiency in automation scripting using Python and Bash is required.
  • Experience with monitoring systems, preferably Prometheus, is essential.
  • Familiarity with version control systems (git) and CI/CD systems (GitHub Actions) is required.
  • A solid understanding of Linux systems operations and troubleshooting is necessary.
  • Candidates must demonstrate a willingness and ability to learn new languages and concepts.
  • 1-6 years of relevant experience is required.

Benefits:

  • Yugabyte is committed to a diverse workforce and provides equal employment opportunities.
  • Employment decisions regarding recruitment and selection are made without discrimination based on various characteristics.
  • The company values collaboration and teamwork, fostering a supportive work environment.
  • Employees have the opportunity to work on cutting-edge technology in a fast-paced startup environment.
  • The role offers the chance to contribute to a mission-critical product that is open source and widely adopted by leading enterprises.
About the job
Leave a feedback