Please, let Sardine know you found this job
on RemoteYeah.
This helps us grow π±.
Description:
The Staff Site Reliability Engineer (SRE) will be responsible for keeping all production services running smoothly.
SREs will blend pragmatic operations with software craftsmanship, applying sound engineering principles, operational discipline, and mature automation.
The role involves building and running core components that process billions of events to protect financial institutions from fraud and compliance risks.
SREs will partner with other engineering teams to enhance the performance, scalability, observability, and reliability of their services.
Responsibilities include running infrastructure with Terraform, CI/CD (Github and ArgoCD), and Kubernetes, along with the devops team.
A proactive approach to monitoring is required, focusing on alerting symptoms rather than outages.
Participation in on-call rotations is expected from all engineering team members.
The role involves improving and automating operational processes and enhancing product security.
Debugging production issues across services and levels of the stack is a key responsibility.
SREs will partner with engineering teams to ensure their products meet production standards and will be encouraged to tackle unique issues outside their comfort zone.
The role also includes helping shape the company's engineering culture and maintaining high engineering standards.
Requirements:
The ideal candidate should have 7+ years of experience designing, building, and operating large-scale production systems.
Experience with Google Cloud Platform is required.
Familiarity with monitoring tools like Datadog and preferably open-source tools such as Prometheus, Grafana, and Jaeger (tracing) is necessary.
Elastic search experience is considered a plus.
Candidates should have experience with container orchestration tools like Kubernetes and deployment tools that support Kubernetes, such as ArgoCD and Helm.
Strong programming skills in GoLang and/or other programming languages are essential.
A strong knowledge of database optimization is required.
Good knowledge of ensuring security practices within cloud infrastructure is necessary.
Benefits:
The position offers generous compensation in both cash and equity.
Employees can take advantage of early exercise for all options, including pre-vested options.
The company promotes a remote-first culture, allowing employees to work from anywhere.
Flexible paid time off, including a year-end break and self-care days off, is provided.
Health insurance, dental, and vision coverage is available for employees and their dependents in the US and Canada.
A 4% matching in 401k/RRSP is offered for employees in the US and Canada.
A MacBook Pro will be delivered to the employee's door.
A one-time stipend is provided to set up a home office, including a desk, chair, screen, etc.
Monthly stipends for meals and social meet-ups are included.
An annual health and wellness stipend is provided.
An annual learning stipend is available for professional development.
Employees have unlimited access to an expert financial advisory.
Apply now
Please, let Sardine know you found this job
on RemoteYeah
.
This helps us grow π±.