This job post is closed and the position is probably filled. Please do not apply.
🤖 Automatically closed by a robot after apply link
was detected as broken.
Description:
GoFundMe is a global community focused on helping others, with a mission to provide best-in-class technology for fundraising.
The company has empowered people and organizations to raise over $30 billion since 2010 and aims to become the most helpful place in the world.
The Senior Cloud Ops Engineer will join the Platform Infrastructure and Operations team, focusing on building and maintaining advanced cloud infrastructure for the online fundraising platform.
This role is crucial for ensuring the infrastructure achieves 99.999% availability to support global payments.
Key responsibilities include designing fault-tolerant cloud solutions, fostering a culture of continuous improvement, participating in strategic cloud architecture decisions, enhancing system performance, leading infrastructure resiliency initiatives, driving application resilience, incorporating testing in CI/CD pipelines, and participating in on-call rotations for incident resolution.
Requirements:
A Bachelor’s Degree in Computer Science or a related field, or 8+ years of equivalent practical experience is required.
A minimum of 6 years of experience in designing and managing scalable, cloud-based infrastructure, preferably in SaaS environments, is necessary.
Candidates must have deep technical expertise in computer science, strong engineering skills, and a commitment to high-quality solutions.
Expert-level knowledge of AWS cloud services, container technologies like Docker and Kubernetes, and Infrastructure as Code (IaC) tools like Terraform and CloudFormation is essential.
Proficiency in software architecture, including asynchronous event-driven architecture and microservices, is required.
Experience in performance and reliability testing using tools like Artillery, K6, or similar frameworks is necessary.
Candidates should have experience in defining, monitoring, and managing Service Level Indicators (SLIs) and Service Level Objectives (SLOs).
Proven expertise in disaster recovery planning and execution is required.
Hands-on experience with application performance management (APM) tools like New Relic, DataDog, and Splunk is necessary.
Advanced scripting and development skills in Bash, PHP, and NodeJS are required.
Candidates should be skilled in managing distributed data systems and troubleshooting complex issues under high load.
Knowledge of compliance regulations, including PCI, SOC2, and GDPR, is necessary.
Benefits:
Employees will be part of a mission-driven organization that positively impacts millions of lives each year.
The position offers the opportunity to lead in a high-impact product organization and drive business transformation.
Employees will collaborate with a diverse, passionate, and talented team in a fast-paced and innovative environment.
The company promotes a fun, supportive team culture that celebrates accomplishments together.
GoFundMe emphasizes core values such as being impatient to be great, finding a way, earning trust every day, and being fueled by purpose.