Please, let ZayZoon know you found this job
on RemoteYeah.
This helps us grow 🌱.
Description:
ZayZoon is a fast-growing Financial Technology and HR company aiming to empower small and midsize businesses with financial tools.
The company has created a financial empowerment platform focused on employee financial wellness.
The role of Senior Site Reliability Engineer involves enhancing ZayZoon’s cloud infrastructure through complex AWS builds, infrastructure-as-code, and observability/logging/APM solutions.
The engineer will work in an embedded reliability team alongside app and data engineers to monitor, benchmark, and scale ZayZoon’s products.
Responsibilities include developing and maintaining infrastructure-as-code CloudFormation templates, focusing on serverless resources like ECS, Fargate, and Lambda.
The role requires instrumentation and daily metrics analysis of infrastructure performance and Ruby on Rails applications using AWS tools and third-party observability platforms.
The engineer will manage deployment pipelines, including blue/green deployments and intelligent auto-scaling.
Maintaining resource dependencies, particularly for databases, and projecting costs for AWS savings programs are also key responsibilities.
Collaboration with risk and security teams to ensure SOC-2 and cybersecurity compliance is essential.
The engineer will work closely with app developers and data engineers on shared metrics, database performance, and data warehouse development.
Participation in the agile development process, including sprint planning and stand-ups, is expected.
Adherence to secure coding practices and the software development lifecycle (SDLC) is required.
Requirements:
Candidates must have 5+ years of infrastructure experience.
A minimum of 2+ years of AWS experience, including certification and deployment of production applications, is required.
Proficiency with Infrastructure as Code (IaC), specifically CloudFormation, is necessary.
Experience with containerization technologies such as Docker, ECS, and ECR is required.
Candidates should have experience analyzing and addressing performance issues using observability platforms like DataDog, NewRelic, or OTel.
The ability to build quickly for experimentation and cleanly for core functionality is essential.
Strong SQL and data analysis skills, along with a willingness to engage in data-driven problem solving, are required.
Benefits:
Candidates must be located in Canada to be considered for this position.
The role is available on a permanently remote basis, allowing for flexible work arrangements.
Candidates must have access to a secure high-speed internet connection and a secure workspace to protect private information.
As part of the hiring process, reference calls with previous managers and a criminal record check will be conducted.
A basic security clearance will also be required due to the nature of the business.
Apply now
Please, let ZayZoon know you found this job
on RemoteYeah
.
This helps us grow 🌱.