Remote Lead Site Reliability Engineer

Posted

Apply now
Please, let HighLevel know you found this job on RemoteYeah. This helps us grow 🌱.

Description:

  • HighLevel is seeking a Lead Site Reliability Engineer to ensure the availability, performance, and scalability of critical systems.
  • The role involves working closely with development and operations teams to automate processes, enhance system reliability, and improve observability.
  • Responsibilities include developing and improving observability using monitoring, logging, tracing, and alerting tools, optimizing system performance, troubleshooting incidents, and conducting post-mortems to prevent future issues.
  • The engineer will collaborate with developers to enhance application reliability, scalability, and performance, drive cost optimization efforts in cloud environments, and monitor multiple databases.
  • The position also requires providing technical leadership and mentorship to SRE team members, fostering a culture of continuous learning and knowledge sharing in site reliability practices.

Requirements:

  • Candidates must have 5+ years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles.
  • Hands-on experience with cloud platforms such as GCP and AWS is required.
  • Proficiency in Infrastructure as Code (IaC) tools like Terraform, Helm, or equivalent is necessary.
  • Experience with containerization and orchestration tools, specifically Docker and Kubernetes (GKE), is essential.
  • Candidates should have experience with observability tools such as Prometheus, Grafana, ELK, OpenTelemetry, or similar monitoring/logging tools.
  • Proficiency in programming or scripting languages, particularly Python, Bash, or Shell scripting, is required, along with a basic understanding of API parsing and JSON manipulation.
  • Hands-on experience with CI/CD pipelines using tools like Jenkins, GitHub Actions, ArgoCD, or similar is necessary.
  • Experience in incident management, including on-call rotations, SLOs, SLIs, SLAs, escalation policies, and incident resolution, is required.
  • Candidates should have experience in monitoring databases such as MongoDB, Redis, Elasticsearch, and queue-based systems.

Benefits:

  • HighLevel promotes a strong company culture that fosters creativity, collaboration, and a healthy work-life balance for employees.
  • The company values diversity and is committed to inclusive hiring and promotion practices.
  • Employees are encouraged to be their true selves and are welcomed for their differences, contributing to a supportive work environment.
  • Reasonable accommodations may be made to enable individuals with disabilities to perform essential functions of the job.
Apply now
Please, let HighLevel know you found this job on RemoteYeah . This helps us grow 🌱.
About the job
Posted on
Job type
Salary
-
Report this job

Job expired or something else is wrong with this job?

Report this job
Leave a feedback