Remote Site Reliability Engineer

Posted 3 weeks ago

Share:

Please let Runpod, Inc. know you found this job on RemoteYeah. This helps us get more companies to post jobs here for you.

Description:

  • Runpod is a platform for developers to build and run custom AI systems, focusing on reliability and operational excellence.
  • The Reliability team ensures system resilience, observability, and scalability, working cross-functionally with other teams.

Requirements:

  • 5+ years of experience in SRE, Reliability Engineering, or Production Engineering.
  • Strong expertise in Linux systems, networking, and managing containerized production systems.
  • Proven experience in defining SLIs/SLOs, incident response, and postmortem leadership.
  • Strong scripting or programming skills and experience with monitoring systems.
  • Excellent written communication skills.

Benefits:

  • Competitive base pay ranging from $150,000 to $200,000 USD, with equity options.
  • Generous medical, dental, and vision plans.
  • Flexible PTO and remote work options.
  • Opportunity to work in a collaborative, inclusive environment on cutting-edge AI infrastructure.

Job type

Experience level

Required experience

5 years

Salary

$150,000—$200,000 / year

Degree requirement

No degree required

Location requirements

Report this job

Job expired or something else is wrong with this job?

Report job
SerpApi

SerpApi

Scrape Google and other search engines from our fast, easy, and complete API.

RemoteYeah Ads