Remote Senior Site Reliability Engineer (REMOTE) at Weedmaps

Description:

As a Senior Site Reliability Engineer at Weedmaps, you will work cross-departmentally with partners on the application, infrastructure, and quality teams to enhance the performance, reliability, resilience, and scalability of Weedmaps.com web services.
The organization is cloud-native, with 100% of services in Docker running on Kubernetes in AWS’ public cloud.
Your daily focus will include leveraging engineering skills to assist in building, monitoring, reducing developer toil, configuring CI workflows, and improving deployment pipelines.
You will serve as a knowledge reference for development teams to ensure consistent tooling for metrics, logging, build, and deployment.
Collaborate with development and infrastructure teams to identify essential service-specific metrics that need monitoring and work with application development teams to create libraries for easy service instrumentation.
Responsibilities include troubleshooting deployment issues in the CI/CD pipeline, advocating for the DevOps culture, identifying areas for automation, and creating synthetic monitoring flows.
You will help teams understand the reliability of their services using metrics and observability.

A minimum of 5 years of experience at startup or mid-sized companies is required.
Proficiency in at least one programming language such as Python, Go, Node, Ruby, or Elixir is necessary.
Experience using and operating Kubernetes in a production environment is essential.
Effective communication skills, a positive attitude, and the ability to give and receive constructive feedback are required.
The ability to learn quickly and adapt to changing environments is a must.
A strong bias for action and decision-making capabilities are essential.
Self-management skills, including prioritization and time management, are required.
Professional experience with cloud-native observability standards such as Open Metrics, Open Tracing, and Open Census is necessary.
Expertise in using and configuring modern CI/CD workflows is required.
An intimate understanding and experience implementing SLIs, SLOs, and SLAs from the service level to the business level is essential.
A deep understanding of the GitHub branching strategy is required.
Experience troubleshooting containerized applications is necessary.
Familiarity with Infrastructure as Code, automation, and configuration is required.

Physical health benefits include medical, dental, and vision coverage with 100% employer-paid premiums and company contributions to a Health Savings Account for those electing the High Deductible Health Plan.
Mental health benefits include free access to the CALM app for employees and dependents, employee training, and mental health seminars and Q&A sessions.
Basic life and AD&D insurance is employer-paid at 1x salary up to $250,000.
A 401(k) retirement plan with employer match contributions is offered.
Generous PTO, paid sick leave, and company holidays are provided.
Supplemental voluntary benefits, including student loan repayment and 529 education savings with company contributions, are available.
Flexible spending accounts (FSA) for medical, dependent, transit, and parking expenses are offered.
Additional benefits include voluntary life and AD&D insurance, critical illness insurance, accident insurance, short- and long-term disability insurance, pet insurance, family planning/fertility support, identity theft protection, and legal access to a network of attorneys.
Paid parental leave is also provided.