Remote Lead Site Reliability Engineer- Remote

at Sprinto

Posted 5 days ago 0 applied

Description:

  • Sprinto is a leading platform that automates information security compliance, ensuring compliance and healthy operational practices for businesses to grow and scale confidently.
  • The company has over 200 employees and serves more than 1000 customers across 75 countries.
  • Sprinto has raised 32 million USD in funding, including a Series B round, from top investment partners such as Accel, ELEVATION, and Blume Ventures.
  • As a Lead Site Reliability Engineer, you will take ownership of the observability pipeline, CI/CD pipeline development, and full infrastructure management to ensure high availability, scalability, and reliable product delivery.
  • You will collaborate with application engineers to develop necessary tooling for efficient operations.
  • Responsibilities include managing the observability pipeline, developing CI/CD pipelines, managing the complete infrastructure stack, collaborating with application engineers, and establishing incident response processes.

Requirements:

  • You must have expertise in Infrastructure as Code (IaC) tools, specifically proficiency with Terraform and Ansible.
  • Experience with Application Performance Monitoring (APM) tools is required, including setting up on-call practices and identifying bottlenecks across the stack.
  • Proven experience in application capacity planning and owning incident response workflows is necessary, including conducting Root Cause Analyses (RCAs) and maintaining runbooks.
  • Strong problem-solving abilities and excellent communication skills, both spoken and written, are essential.
  • Familiarity with the current tech stack, which includes Node.js, React, Apollo GraphQL, PostgreSQL, and AWS, is a plus but not mandatory.

Benefits:

  • The position offers a remote-first policy, allowing for flexible work arrangements.
  • Employees will work five days a week with flexible hours.
  • Group medical insurance is provided for employees and their families, including parents, spouses, and children.
  • There is a group accident cover for added security.
  • The company sponsors devices for employees to ensure they have the necessary tools for their work.
  • An education reimbursement policy is in place to support continuous learning and development.