Arbor is seeking a proactive Site Reliability Engineer to join their SRE team and enhance platform resilience and performance.
The role involves advising on site reliability aspects such as availability, scalability, observability, and capacity planning.
Responsibilities include monitoring platform performance, collaborating with engineering teams to resolve performance issues, and improving observability through tools like DataDog or Prometheus.
The engineer will ensure high availability and resilience of services, champion best practices, and conduct capacity assessments for future scaling needs.
The position requires close collaboration with various teams to provide excellent service and embed SRE practices.
The engineer will also be involved in incident response, troubleshooting, and maintaining documentation and playbooks.
Requirements:
Candidates must have experience in performance monitoring and analysis.
Capacity planning experience is essential.
Proficiency in scripting and automation with relevant technologies is required.
Experience with Infrastructure as Code, particularly Terraform, is necessary.
A solid understanding of relational database technologies and their cloud versions, such as AWS Aurora, is needed.
Familiarity with messaging and distributed asynchronous workloads is important.
Experience with nginx or similar technologies is required.
Candidates should be familiar with SRE processes and aware of DevOps principles.
Benefits:
Arbor offers a chance to work with a passionate team where the impact of your work is visible daily.
Employees benefit from a dedicated wellbeing team and initiatives like mindfulness and mental health training.
The position includes 32 days of holiday plus Bank Holidays, with additional company-wide days off.
Life Assurance is provided at 3x the annual salary.
Comprehensive wellness benefits include a 24/7 virtual GP service and mental health support.
Private Dental Insurance with Bupa is included.
A salary sacrifice Pension plan is provided by Scottish Widows.
Enhanced maternity, adoption, and paternity leave policies are in place.
Employees have access to financial wellbeing coaching services.
Flexible working arrangements are supported.
There are opportunities for professional development and volunteering.
The workplace is dog-friendly, promoting a positive work environment.