Remote Lead Site Reliability Engineer

Posted

This job is closed

This job post is closed and the position is probably filled. Please do not apply.  Automatically closed by a robot after apply link was detected as broken.

Description:

  • Architect and lead the delivery of high-quality and reliable solutions through creative problem-solving and technical expertise.
  • Enable Engineers on the team to improve the quality and impact of their work and delivery.
  • Evangelize reliability-as-a-feature through monitoring, service-level objectives, automation, everything-as-code, and testing.
  • Provide technical leadership and guidance to the SRE team, driving best practices in reliability engineering, automation, and service management.
  • Set the direction for SRE projects, aligning them with organizational goals, and ensuring successful execution from concept to delivery.
  • Define and instrument Service-Level Objectives to ensure the best customer experience.
  • Lead initiatives to improve system resilience and scalability.
  • Host postmortems to share learnings, discover gaps, embrace transparency, and improve reliability across services.
  • Lead projects from inception to completion.
  • Participate in an on-call rotation to assist in finding resolutions during incidents.

Requirements:

  • 7+ years of experience building infrastructure solutions in AWS using Infrastructure-as-Code technologies like Terraform or CloudFormation.
  • 7+ years of experience working with Docker containers and related orchestration technologies (such as Kubernetes or ECS).
  • 7+ years of experience building and deploying CI/CD pipelines.
  • Experience with AWS, Docker, Kubernetes, Terraform, Python, PHP, and Laravel.
  • Experience with architectural patterns of large, high-scale applications.
  • Experience leading wide-scale and complex projects and initiatives.
  • Experience working collaboratively in cross-functional teams.
  • Deep technical expertise in writing, debugging, and refactoring code.
  • Demonstrated mastery in automation, infrastructure, and developer tooling.
  • Experience leveraging observability tooling and practices such as SLOs.
  • Leadership skills to define and deliver large, complex projects and champion the SRE function throughout the organization.

Benefits:

  • Competitive salary and equity packages.
  • Company Performance Incentive Plan.
  • Comprehensive benefits including medical, dental, and vision insurance, flexible spending account, 401k, mental health & wellness programs.
  • $75 WFH stipend for remote employees.
  • Home office setup stipend for remote employees.
  • Minimum Time Off policy (unlimited PTO, with at least 3 weeks off) for exempt employees.
  • 11 company observed holidays.
  • Additional holidays: Curology days off (1 per quarter), 1 annual floating holiday, and Gratitude Week.
  • Paid parental leave.
  • Employee donation matching program.
  • Company-sponsored events.
  • Free subscription to Curology or Agency.
About the job
Posted on
Job type
Salary
$ 133,000 - 205,000 USD / year
Experience level
Leave a feedback