Please, let HighLevel know you found this job
on RemoteYeah.
This helps us grow 🌱.
Description:
HighLevel is seeking a Lead Site Reliability Engineer to ensure the availability, performance, and scalability of critical systems.
The role involves working closely with development and operations teams to automate processes, enhance system reliability, and improve observability.
Responsibilities include developing and improving observability using monitoring, logging, tracing, and alerting tools, optimizing system performance, troubleshooting incidents, and conducting post-mortems to prevent future issues.
The engineer will collaborate with developers to enhance application reliability, scalability, and performance, drive cost optimization efforts in cloud environments, and monitor multiple databases.
The position also requires providing technical leadership and mentorship to SRE team members, fostering a culture of continuous learning and knowledge sharing in site reliability practices.
Requirements:
Candidates must have 5+ years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles.
Hands-on experience with cloud platforms such as GCP and AWS is required.
Proficiency in Infrastructure as Code (IaC) tools like Terraform, Helm, or equivalent is necessary.
Experience with containerization and orchestration tools, specifically Docker and Kubernetes (GKE), is essential.
Candidates should have experience with observability tools such as Prometheus, Grafana, ELK, OpenTelemetry, or similar monitoring/logging tools.
Proficiency in programming or scripting languages, particularly Python, Bash, or Shell scripting, is required, along with a basic understanding of API parsing and JSON manipulation.
Hands-on experience with CI/CD pipelines using tools like Jenkins, GitHub Actions, ArgoCD, or similar is necessary.
Experience in incident management, including on-call rotations, SLOs, SLIs, SLAs, escalation policies, and incident resolution, is required.
Candidates should have experience in monitoring databases such as MongoDB, Redis, Elasticsearch, and queue-based systems.
Benefits:
HighLevel promotes a strong company culture that fosters creativity, collaboration, and a healthy work-life balance for employees.
The company values diversity and is committed to inclusive hiring and promotion practices.
Employees are encouraged to be their true selves and are welcomed for their differences, contributing to a supportive work environment.
Reasonable accommodations may be made to enable individuals with disabilities to perform essential functions of the job.
Apply now
Please, let HighLevel know you found this job
on RemoteYeah
.
This helps us grow 🌱.