Remote Senior Cloud Ops Engineer

Posted

This job is closed

This job post is closed and the position is probably filled. Please do not apply.  Automatically closed by a robot after apply link was detected as broken.

Description:

  • GoFundMe is a global community focused on helping others, with a mission to provide best-in-class technology for fundraising.
  • The company has empowered people and organizations to raise over $30 billion since 2010 and aims to become the most helpful place in the world.
  • The Senior Cloud Ops Engineer will join the Platform Infrastructure and Operations team, focusing on building and maintaining advanced cloud infrastructure for the online fundraising platform.
  • This role is crucial for ensuring the infrastructure achieves 99.999% availability to support global payments.
  • Key responsibilities include designing fault-tolerant cloud solutions, fostering a culture of continuous improvement, participating in strategic cloud architecture decisions, enhancing system performance, leading infrastructure resiliency initiatives, driving application resilience, incorporating testing in CI/CD pipelines, and participating in on-call rotations for incident resolution.

Requirements:

  • A Bachelor’s Degree in Computer Science or a related field, or 8+ years of equivalent practical experience is required.
  • A minimum of 6 years of experience in designing and managing scalable, cloud-based infrastructure, preferably in SaaS environments, is necessary.
  • Candidates must have deep technical expertise in computer science, strong engineering skills, and a commitment to high-quality solutions.
  • Expert-level knowledge of AWS cloud services, container technologies like Docker and Kubernetes, and Infrastructure as Code (IaC) tools like Terraform and CloudFormation is essential.
  • Proficiency in software architecture, including asynchronous event-driven architecture and microservices, is required.
  • Experience in performance and reliability testing using tools like Artillery, K6, or similar frameworks is necessary.
  • Candidates should have experience in defining, monitoring, and managing Service Level Indicators (SLIs) and Service Level Objectives (SLOs).
  • Proven expertise in disaster recovery planning and execution is required.
  • Hands-on experience with application performance management (APM) tools like New Relic, DataDog, and Splunk is necessary.
  • Advanced scripting and development skills in Bash, PHP, and NodeJS are required.
  • Candidates should be skilled in managing distributed data systems and troubleshooting complex issues under high load.
  • Knowledge of compliance regulations, including PCI, SOC2, and GDPR, is necessary.

Benefits:

  • Employees will be part of a mission-driven organization that positively impacts millions of lives each year.
  • The position offers the opportunity to lead in a high-impact product organization and drive business transformation.
  • Employees will collaborate with a diverse, passionate, and talented team in a fast-paced and innovative environment.
  • The company promotes a fun, supportive team culture that celebrates accomplishments together.
  • GoFundMe emphasizes core values such as being impatient to be great, finding a way, earning trust every day, and being fueled by purpose.
About the job
Posted on
Job type
Salary
-
Leave a feedback