Remote Sr. Site Reliability Engineer (Reliability Enablement)

Posted

This job is closed

This job post is closed and the position is probably filled. Please do not apply.  Automatically closed by a robot after apply link was detected as broken.

Description:

  • Xero is seeking a Senior Site Reliability Engineer (Reliability Enablement) to join their team in San Francisco, US, with a focus on delivering a great customer experience through understanding system behavior and operations.
  • The role involves post-incident analysis, advocating for learning from incidents, and engaging with teams across the organization through specialized reliability enablement and consulting.
  • Responsibilities include investigating operational surprises, conducting in-depth incident analysis, and maximizing post-incident learning.
  • The engineer may be embedded within an engineering portfolio or work within the central reliability enablement team, advocating for reliability and incident learning.
  • The position requires participation in the SRE On Call function, providing specialist incident commander capabilities for complex major and critical incidents.
  • The role includes improving on-call health, uplifting observability, addressing operational hotspots, and supporting the delivery of strategic features with reliability expertise.

Requirements:

  • Candidates must have solid experience in logging, monitoring, and observability of highly distributed systems.
  • Experience in leading incident management and response, including handling critical, complex, and high-severity incidents is required.
  • Candidates should have a background in conducting post-incident reviews and learning from incidents.
  • Experience working in a tech or product company with comparable scale and complexity is necessary.
  • A systems thinking approach, understanding how systems and components interact and respond to failure, is essential.
  • Proficiency in one or more object-oriented programming languages (C#, JavaScript, Java, Python, etc.) or experience with infrastructure-as-code (e.g., Terraform, CloudFormation) is required.
  • Preferred qualifications include experience with cloud providers (AWS, Azure, GCP), designing and operating distributed systems, and delivering technical initiatives in an operational or site reliability capacity.
  • Candidates should demonstrate the ability to solve engineering challenges outside their own team and have experience in reliability concepts like capacity management and fault tolerance.

Benefits:

  • Xero offers a competitive salary range of $170,000 - $195,000 per year.
  • The company promotes a human-first culture of respect, fairness, and inclusion, fostering diversity of thought.
  • Employees receive generous paid leave, including dedicated leave for physical and mental wellbeing, and access to an Employee Assistance Program for mental health care.
  • Benefits include medical, dental, vision, and disability insurance, as well as fertility and family forming financial support.
  • Xero provides 401k contribution matching, 26 weeks of paid parental leave for primary caregivers, and an Employee Share Plan.
  • Employees enjoy beautiful office spaces with snacks and break areas, flexible working arrangements, and opportunities for career development.
About the job
Posted on
Job type
Salary
$ 170,000 - 195,000 USD / year
Experience level
Leave a feedback