This job post is closed and the position is probably filled. Please do not apply.
🤖 Automatically closed by a robot after apply link
was detected as broken.
Description:
As a Senior Site Reliability Engineer, you will be responsible for managing cloud infrastructure at scale while following best practices.
Your top priority is to ensure production systems maintain a four nine’s uptime.
You will help enhance performance, automation, reliability, and security in our cloud-hosted infrastructure.
You will collaborate closely with software engineering teams to educate and assist in releasing stable microservices.
The role involves operating systems with mature automation and robust engineering practices.
Daily tasks may include hosting workshops on the latest tech tools and building sophisticated backend systems using Infrastructure-as-Code.
You will maintain and develop solutions to support CI/CD at scale.
You are expected to bring innovative ideas and concepts to the team.
You will develop processes and tools for optimizing security.
Monitoring, forecasting, and creating capacity plans for services will be part of your responsibilities.
You will ensure that backup and recovery systems are functional, tested, and monitored.
Utilizing automation for repeatable tasks is essential.
You will educate and support engineers on best practices.
Participation in a rotating 24x7 on-call group is required.
Requirements:
Proficiency in Python, Bash, Groovy, or Go is required.
Extensive knowledge and experience in implementing DevOps methodologies are necessary.
Strong teamwork skills across Operations and Engineering teams are essential, with the ability to work effectively with Senior Management.
A passion for maintaining and writing clear, comprehensive technical documentation is important.
Hands-on experience with Google Cloud Platform (GCP) and/or Amazon Web Services (AWS) is required.
Experience with Jenkins, GitLab, and Git for continuous integration and source control is necessary.
Proficiency in Kubernetes, Helm, or similar templating libraries is required.
Experience with Ansible or Salt for configuration management is necessary.
Familiarity with centralized logging systems and monitoring/alerting tools is important.
A solid understanding of traditional infrastructure services, including SMTP, DNS, Load Balancing, and TCP/IP Networking, is required.
Experience with Terraform, Packer, Consul, and Vault for infrastructure automation and management is necessary.
Benefits:
The reasonably estimated salary for this role ranges from $132,000 to $190,000 and may include variable compensation.
Actual compensation is based on the candidate's skills, qualifications, and experience.
Everbridge offers a wide range of best-in-class, comprehensive, and inclusive employee benefits.
Benefits include healthcare, dental, parental planning, and mental health benefits.
Disability income benefits, life and AD&D insurance, and a 401(k) plan with a match are provided.
Paid time off and fitness reimbursements are also included in the benefits package.