Remote Sr. Site Reliability Engineer (Reliability Enablement)
Posted
This job is closed
This job post is closed and the position is probably filled. Please do not apply.
π€ Automatically closed by a robot after apply link
was detected as broken.
Description:
Xero is seeking a Senior Site Reliability Engineer (Reliability Enablement) to join their team in San Francisco, US, with a focus on delivering a great customer experience through understanding system behavior and operations.
The role involves post-incident analysis, advocating for learning from incidents, and engaging with teams across the organization through specialized reliability enablement and consulting.
Responsibilities include investigating operational surprises, conducting in-depth incident analysis, and maximizing post-incident learning.
The engineer may be embedded within an engineering portfolio or work within the central reliability enablement team, advocating for reliability and incident learning.
The position requires participation in the SRE On Call function, providing specialist incident commander capabilities for complex major and critical incidents.
The role includes improving on-call health, uplifting observability, addressing operational hotspots, and supporting the delivery of strategic features with reliability expertise.
Requirements:
Candidates must have solid experience in logging, monitoring, and observability of highly distributed systems.
Experience in leading incident management and response, including handling critical, complex, and high-severity incidents is required.
Candidates should have a background in conducting post-incident reviews and learning from incidents.
Experience working in a tech or product company with comparable scale and complexity is necessary.
A systems thinking approach, understanding how systems and components interact and respond to failure, is essential.
Proficiency in one or more object-oriented programming languages (C#, JavaScript, Java, Python, etc.) or experience with infrastructure-as-code (e.g., Terraform, CloudFormation) is required.
Preferred qualifications include experience with cloud providers (AWS, Azure, GCP), designing and operating distributed systems, and delivering technical initiatives in an operational or site reliability capacity.
Candidates should demonstrate the ability to solve engineering challenges outside their own team and have experience in reliability concepts like capacity management and fault tolerance.
Benefits:
Xero offers a competitive salary range of $170,000 - $195,000 per year.
The company promotes a human-first culture of respect, fairness, and inclusion, fostering diversity of thought.
Employees receive generous paid leave, including dedicated leave for physical and mental wellbeing, and access to an Employee Assistance Program for mental health care.
Benefits include medical, dental, vision, and disability insurance, as well as fertility and family forming financial support.
Xero provides 401k contribution matching, 26 weeks of paid parental leave for primary caregivers, and an Employee Share Plan.
Employees enjoy beautiful office spaces with snacks and break areas, flexible working arrangements, and opportunities for career development.