This position is for a Team Lead, Site Reliability Engineering at Pythian, located in the United Kingdom, and is remote.
The role involves leading a skilled Site Reliability Engineering team in a fast-paced and technically complex environment.
Responsibilities include overseeing the design, deployment, and operation of large-scale distributed systems while ensuring service reliability, scalability, and performance.
The Team Lead will act as a primary escalation point for critical incidents, mentor engineers, and drive automation initiatives to improve operational efficiency.
The position requires a balance of strong technical expertise and proven leadership skills, including hands-on contributions, coaching, resource planning, and delivery management.
The role offers the opportunity to shape next-generation infrastructure while working with cutting-edge technologies in cloud, AI/ML, and distributed systems.
Key accountabilities include leading and mentoring the team, managing ticket queues and SLA compliance, conducting performance reviews, and contributing to delivery projects.
Requirements:
A minimum of 3 years of proven experience leading technical teams in site reliability, DevOps, or related fields is required.
Strong expertise with Google Cloud and infrastructure-as-code tools, specifically Terraform, is necessary.
In-depth knowledge of microservices, containers (Kubernetes, Docker), and service mesh technologies is essential.
Hands-on experience with Linux systems administration, PKI, networking, and distributed systems is required.
Proficiency in automation using Go, Python, or Shell scripting is necessary.
Demonstrated experience with monitoring and observability tools such as Prometheus, Grafana, and Loki is required.
Strong problem-solving, leadership, and communication skills are essential, with the ability to thrive in a fast-paced environment.
A mindset focused on scalability, reliability, and automation, adhering to SRE principles, is expected.
Benefits:
The position offers a competitive total rewards package with performance incentives.
There is a substantial training allowance for certifications, professional development days, and continuous learning opportunities.
The role provides remote-first flexibility, with occasional travel to Brighton as needed.
A full home office setup is included, featuring a laptop (choice of OS) and an annual workspace personalization budget.
An annual wellness budget is provided for gym memberships, fitness, or wellness activities.
Generous paid vacation, sick leave, and one annual day off for volunteering are included in the benefits.
The company promotes a collaborative culture with industry-leading peers and opportunities to work on cutting-edge technologies.