Remote Principal Site Reliability Engineer

Posted

This job is closed

This job post is closed and the position is probably filled. Please do not apply.  Automatically closed by a robot after apply link was detected as broken.

Description:

  • The Principal Site Reliability Engineer will enhance the company’s SaaS infrastructure security protocols.
  • This role involves collaborating across the organization to design, build, and operationalize SaaS services that conform to various security standards such as FedRAMP, SOC2, and ISO.
  • The engineer will participate in architecture, security, and operations reviews.
  • They will lead design reviews and the buildout of secure systems for delivering various SaaS services with a target uptime of 99.99%.
  • Responsibilities include designing, automating, testing, and monitoring the use of cloud-native technologies as a foundation for a service platform.
  • The engineer will investigate and resolve customer and operational issues with a focus on fixing rather than just mitigating issues.
  • They will identify and automate the measurement of operations SLAs and SLOs.
  • The role includes triaging incident responses, documenting SOPs and Runbooks, and training NOC team members.
  • Writing automation that can be easily supported and extended by others is also a key responsibility.
  • The engineer will work on special projects as assigned.

Requirements:

  • Candidates must be U.S. Citizens.
  • A minimum of 7-10 years of site reliability engineering or cloud operations experience, or equivalent experience, is required.
  • A proven track record of operating production SaaS environments within security standards such as FedRAMP, SOC2, ISO, and PCI is essential.
  • A Bachelor’s or Master’s degree in Computer Science, Information Systems, or a similar field is required.
  • Candidates should be skilled at problem-solving, algorithms, and data structures that conform to modern SaaS security requirements.
  • Experience in building tools and scripting frameworks from scratch is necessary.
  • Proficiency with Cloud Automation tools like CloudFormation, Terraform, CDK, and aws-cli is required.
  • Candidates should be familiar with scripting languages such as Python, Groovy, PowerShell, Bash, and Perl.
  • Exposure to Windows and Linux administration skills is necessary.
  • Familiarity with basic networking, security, and cloud engineering concepts is required.
  • Candidates must be highly collaborative with effective written and verbal communication skills.
  • The ability to work against tight deadlines and occasionally after-hours, as part of an on-call schedule, is necessary.
  • Candidates should be willing to take full responsibility for the availability and performance of the platform.

Benefits:

  • The position offers a remote-first culture, allowing employees to work from home or come into the office as they prefer.
  • Comprehensive medical, dental, and vision plans are provided.
  • A 401(k) plan with employer match is available.
  • Flexible Paid Time Off (FTO) is offered to allow employees to take the time they need to re-energize.
  • Employees can take two days off per calendar year for Volunteer Time Off (VTO) to volunteer with their preferred charitable organization.
  • A 5-year Service Milestone Sabbatical is included.
  • Paid parental leave is provided.
  • There is a generous employee referral bonus program.
  • Pet insurance is available.
  • The HQ office is centrally located in Reston Town Center and features a well-stocked kitchen with rotating snacks and beverages, along with catered lunch on Thursdays.
  • Regular virtual company-wide events, including cooking classes, yoga, meditation, and more, are organized.
  • Employees have the opportunity to learn and develop from some of the best and brightest minds in the industry.
About the job
Posted on
Job type
Salary
-
S
ScienceLogic's company logo
ScienceLogic
View company profile
Leave a feedback