Remote Senior Site Reliability Engineer - Data Warehouse (ClickHouse)
Posted
This job is closed
This job post is closed and the position is probably filled. Please do not apply.
🤖 Automatically closed by a robot after apply link
was detected as broken.
Description:
The Senior Site Reliability Engineer will be responsible for managing redundant and scalable ClickHouse clusters across multiple cloud providers.
They will educate and train team members on ClickHouse cluster design and management.
The role involves maintaining and enhancing essential IT infrastructure, ensuring system security, scalability, and reliability.
Automation of infrastructure processes using Infrastructure as Code (IaC) will be a key responsibility.
Monitoring performance, managing infrastructure incidents, and developing cost-effective strategies while maintaining high-quality standards are part of the role.
Requirements:
Proven expertise in managing large-scale ClickHouse deployments.
In-depth understanding of ClickHouse architecture, including configuration, scaling, and cluster management.
Experience with cloud services (AWS, GCP, Azure) and managing databases across multiple environments.
Strong proficiency in Terraform, Kubernetes, and additional IaC tools for efficient infrastructure management.
Expertise in Python for scripting and automation.
Solid understanding of SRE principles and quick problem-resolution skills.
Excellent verbal and written communication skills.
Leadership skills with minimal oversight are required.
Benefits:
Competitive and generous total compensation package including equity options and 401k option.
Comprehensive health, dental, and vision plans with company contribution.
Flexible vacation and paid time off policy.
Team events and off-sites for team building.
Budget for online courses, books, and conferences for continuous learning.
Employee wellness programs to support self-care and overall wellness.