This job post is closed and the position is probably filled. Please do not apply.
🤖 Automatically closed by a robot after apply link
was detected as broken.
Description:
At Yugabyte, we are on a mission to become the default transactional database for enterprises building cloud-native applications.
YugabyteDB is our PostgreSQL-compatible distributed database designed for cloud-native apps, which is resilient, scalable, and flexible.
The role of Site Reliability Engineer focuses on database availability and reliability, operating and automating the life cycle of the YugabyteDB DBaaS.
Responsibilities include designing and building processes to manage databases using secure, reliable, scalable, and highly observable methodologies.
The engineer will work with Kubernetes environments (GKE, EKS, AKS), Java frameworks, Shell scripts, Python scripts, Terraform templates, and other cloud technologies.
Participation in the on-call rotation for 12 hours a day over 7 days, every 4-5 weeks, is required to manage incidents on the DBaaS infrastructure.
The role involves diagnosing problems with database and infrastructure technology to deliver reliable service to customers.
We seek strong engineers who exemplify collaboration, teamwork, and empathy, and thrive in a fast-paced startup environment.
Requirements:
Candidates must have strong software design and implementation skills in building infrastructure frameworks.
Experience in building and operating data systems for production applications, including fault-tolerant designs and automation of critical operations, is required.
A strong track record of Incident Response and Management in a managed service that is mission-critical for customers is essential.
Experience with relational database systems, preferably PostgreSQL, is necessary.
Familiarity with public cloud infrastructure (AWS, GCP, and/or Azure) is required.
Knowledge of containerization tooling, theory, and design (Docker, Kubernetes) is essential.
Experience with Infrastructure as Code, preferably Terraform, is required.
Familiarity with Configuration Management Tooling, preferably Ansible, is necessary.
Proficiency in automation scripting using Python and Bash is required.
Experience with monitoring systems, preferably Prometheus, is essential.
Familiarity with version control systems (git) and CI/CD systems (GitHub Actions) is required.
A solid understanding of Linux systems operations and troubleshooting is necessary.
Candidates must demonstrate a willingness and ability to learn new languages and concepts.
1-6 years of relevant experience is required.
Benefits:
Yugabyte is committed to a diverse workforce and provides equal employment opportunities.
Employment decisions regarding recruitment and selection are made without discrimination based on various characteristics.
The company values collaboration and teamwork, fostering a supportive work environment.
Employees have the opportunity to work on cutting-edge technology in a fast-paced startup environment.
The role offers the chance to contribute to a mission-critical product that is open source and widely adopted by leading enterprises.