Please, let Masabi know you found this job
on RemoteYeah.
This helps us grow 🌱.
Description:
Join the Site Reliability Engineering team at Masabi to ensure the reliability, performance, and security of the fare collection platform.
Drive automation to reduce operational overhead and human error by building CI/CD pipelines and developing Infrastructure as Code (IaC) using tools like Terraform and CloudFormation.
Refine processes, tools, and workflows to enhance system reliability, scalability, and efficiency while planning capacity for future needs.
Ensure infrastructure meets organizational security standards and supports compliance frameworks like SOC 2 and PCI.
Maintain real-time monitoring systems aligned with SLIs and SLOs to ensure uptime and performance meet or exceed SLAs.
Monitor and optimize cloud infrastructure costs through autoscaling, rightsizing, and architectural reviews.
Implement failover strategies, disaster recovery plans, and redundancy to ensure system resilience.
Respond to production incidents, minimize downtime, and restore availability while performing root cause analysis and contributing to post-incident reviews.
Collaborate with developers to design reliable systems and coach teams on best practices for reliability and scalability.
Maintain detailed documentation for infrastructure, incident response, and workflows, developing playbooks and runbooks for knowledge transfer.
Requirements:
Significant experience in SRE or related roles with a proven track record in building and maintaining reliable systems.
Expertise in AWS Cloud technologies is essential.
Hands-on experience with Terraform and Grafana, along with strong knowledge of security principles and networking components.
Essential hands-on experience with EKS and ECS.
Experience in building pipelines and robust CI/CD infrastructure.
A collaborative team player who approaches projects with an open mind and prioritizes security.
Passionate about leveraging technology to drive advancements while ensuring reliability and security.
Excellent communication skills, a collaborative mindset, and a willingness to learn and contribute to team success.
Self-sufficient and capable of working independently, while also knowing when to seek support or input.
Familiarity with PCI DSS v4 Compliance requirements is a plus.
AWS Cloud certification is desirable.
Benefits:
Join a network of innovators from diverse backgrounds who are passionate about improving accessibility and making fares fair for everyone.
Work in an environment that celebrates multiple approaches and points of view, empowering individuals to bring their authentic selves to work.
Be part of a mission-driven company that supports transit agencies and enhances the lives of millions of riders.
Enjoy a culture of openness and collaboration, with opportunities for personal and professional growth.
Apply now
Please, let Masabi know you found this job
on RemoteYeah
.
This helps us grow 🌱.