Remote Site Reliability Engineer

Posted

This job is closed

This job post is closed and the position is probably filled. Please do not apply.  Automatically closed by a robot after apply link was detected as broken.

Description:

  • Beacon Biosignals is seeking a skilled Site Reliability Engineer to join their Platform team.
  • The role involves ensuring the reliability, availability, and security of Beacon's cloud infrastructure that supports large-scale machine learning on terabytes of biosignal data.
  • Responsibilities include building and maintaining critical systems, such as Kubernetes clusters and observability infrastructure.
  • The engineer will design and implement infrastructure as code solutions to improve reliability, security, and maintainability.
  • The position requires leading major infrastructure initiatives, developing and maintaining CI/CD pipelines, and improving system observability.
  • Participation in an on-call rotation and leading incident response efforts is expected.
  • Collaboration with development teams to enhance application reliability and performance is essential.
  • The engineer will maintain and enhance security posture through infrastructure hardening and automation.
  • Documentation for infrastructure, deployment processes, and incident response procedures must be created and maintained.

Requirements:

  • Strong experience with Kubernetes administration, including cluster management, security, and troubleshooting is required.
  • A proven track record of implementing infrastructure as code using Terraform or similar tools is necessary.
  • Experience in building and maintaining CI/CD pipelines, particularly with GitHub Actions and ArgoCD, is essential.
  • A solid understanding of container technologies and build processes, especially Docker, is required.
  • Strong knowledge of cloud providers (e.g., AWS) including networking, security, and infrastructure services is needed.
  • Experience with incident response and on-call responsibilities in a production environment is necessary.
  • Deep experience with Linux systems administration and debugging is required.
  • Proficiency in at least one programming language (Python, Go, Typescript, etc.) is essential.
  • Understanding of security and networking concepts including OAuth2/OIDC, DNS, TLS, TCP/UDP, etc., is required.
  • An approximate experience of a Bachelor's degree plus 5-8 years in SRE, DevOps, or similar professional experience is necessary.

Benefits:

  • The base salary range for this role is determined based on past experience, specific skills, and qualifications.
  • The total compensation package includes equity, PTO, and other benefits.
  • Beacon offers a robust asynchronous work environment ensuring a first-class remote work experience.
  • In-person office hubs are available in Boston, New York, and Paris for those who prefer a physical workspace.
Leave a feedback