This job post is closed and the position is probably filled. Please do not apply.
🤖 Automatically closed by a robot after apply link
was detected as broken.
Description:
The Principal Site Reliability Engineer will enhance the company’s SaaS infrastructure security protocols.
This role involves collaborating across the organization to design, build, and operationalize SaaS services that conform to various security standards such as FedRAMP, SOC2, and ISO.
The engineer will participate in architecture, security, and operations reviews.
They will lead design reviews and the buildout of secure systems for delivering various SaaS services with a target uptime of 99.99%.
Responsibilities include designing, automating, testing, and monitoring the use of cloud-native technologies as a foundation for a service platform.
The engineer will investigate and resolve customer and operational issues with a focus on fixing rather than just mitigating issues.
They will identify and automate the measurement of operations SLAs and SLOs.
The role includes triaging incident responses, documenting SOPs and Runbooks, and training NOC team members.
Writing automation that can be easily supported and extended by others is also a key responsibility.
The engineer will work on special projects as assigned.
Requirements:
Candidates must be U.S. Citizens.
A minimum of 7-10 years of site reliability engineering or cloud operations experience, or equivalent experience, is required.
A proven track record of operating production SaaS environments within security standards such as FedRAMP, SOC2, ISO, and PCI is essential.
A Bachelor’s or Master’s degree in Computer Science, Information Systems, or a similar field is required.
Candidates should be skilled at problem-solving, algorithms, and data structures that conform to modern SaaS security requirements.
Experience in building tools and scripting frameworks from scratch is necessary.
Proficiency with Cloud Automation tools like CloudFormation, Terraform, CDK, and aws-cli is required.
Candidates should be familiar with scripting languages such as Python, Groovy, PowerShell, Bash, and Perl.
Exposure to Windows and Linux administration skills is necessary.
Familiarity with basic networking, security, and cloud engineering concepts is required.
Candidates must be highly collaborative with effective written and verbal communication skills.
The ability to work against tight deadlines and occasionally after-hours, as part of an on-call schedule, is necessary.
Candidates should be willing to take full responsibility for the availability and performance of the platform.
Benefits:
The position offers a remote-first culture, allowing employees to work from home or come into the office as they prefer.
Comprehensive medical, dental, and vision plans are provided.
A 401(k) plan with employer match is available.
Flexible Paid Time Off (FTO) is offered to allow employees to take the time they need to re-energize.
Employees can take two days off per calendar year for Volunteer Time Off (VTO) to volunteer with their preferred charitable organization.
A 5-year Service Milestone Sabbatical is included.
Paid parental leave is provided.
There is a generous employee referral bonus program.
Pet insurance is available.
The HQ office is centrally located in Reston Town Center and features a well-stocked kitchen with rotating snacks and beverages, along with catered lunch on Thursdays.
Regular virtual company-wide events, including cooking classes, yoga, meditation, and more, are organized.
Employees have the opportunity to learn and develop from some of the best and brightest minds in the industry.