Remote Senior Site Reliability Engineer

Posted

This job is closed

This job post is closed and the position is probably filled. Please do not apply.  Automatically closed by a robot after apply link was detected as broken.

Description:

  • As a Senior Site Reliability Engineer, you will be responsible for managing cloud infrastructure at scale while following best practices.
  • Your top priority is to ensure production systems maintain a four nine’s uptime.
  • You will help enhance performance, automation, reliability, and security in our cloud-hosted infrastructure.
  • You will collaborate closely with software engineering teams to educate and assist in releasing stable microservices.
  • The role involves operating systems with mature automation and robust engineering practices.
  • Daily tasks may include hosting workshops on the latest tech tools and building sophisticated backend systems using Infrastructure-as-Code.
  • You will maintain and develop solutions to support CI/CD at scale.
  • You are expected to bring innovative ideas and concepts to the team.
  • You will develop processes and tools for optimizing security.
  • Monitoring, forecasting, and creating capacity plans for services will be part of your responsibilities.
  • You will ensure that backup and recovery systems are functional, tested, and monitored.
  • Utilizing automation for repeatable tasks is essential.
  • You will educate and support engineers on best practices.
  • Participation in a rotating 24x7 on-call group is required.

Requirements:

  • Proficiency in Python, Bash, Groovy, or Go is required.
  • Extensive knowledge and experience in implementing DevOps methodologies are necessary.
  • Strong teamwork skills across Operations and Engineering teams are essential, with the ability to work effectively with Senior Management.
  • A passion for maintaining and writing clear, comprehensive technical documentation is important.
  • Hands-on experience with Google Cloud Platform (GCP) and/or Amazon Web Services (AWS) is required.
  • Experience with Jenkins, GitLab, and Git for continuous integration and source control is necessary.
  • Proficiency in Kubernetes, Helm, or similar templating libraries is required.
  • Experience with Ansible or Salt for configuration management is necessary.
  • Familiarity with centralized logging systems and monitoring/alerting tools is important.
  • A solid understanding of traditional infrastructure services, including SMTP, DNS, Load Balancing, and TCP/IP Networking, is required.
  • Experience with Terraform, Packer, Consul, and Vault for infrastructure automation and management is necessary.

Benefits:

  • The reasonably estimated salary for this role ranges from $132,000 to $190,000 and may include variable compensation.
  • Actual compensation is based on the candidate's skills, qualifications, and experience.
  • Everbridge offers a wide range of best-in-class, comprehensive, and inclusive employee benefits.
  • Benefits include healthcare, dental, parental planning, and mental health benefits.
  • Disability income benefits, life and AD&D insurance, and a 401(k) plan with a match are provided.
  • Paid time off and fitness reimbursements are also included in the benefits package.
About the job
Posted on
Job type
Salary
$ 132,000 - 190,000 USD / year
Leave a feedback