Remote Infrastructure Talent Pool (Storage Engineer, Site Reliability Engineer, MLOps)

Posted

Apply now
Please, let Cohere know you found this job on RemoteYeah. This helps us grow 🌱.

Description:

  • Cohere is seeking talent for various infrastructure roles including Storage Engineer, Site Reliability Engineer, and MLOps in locations such as Toronto, San Francisco, Seattle, London, Canada, New York, United States, and Remote UK.
  • The organization's mission is to scale intelligence to serve humanity by training and deploying frontier models for AI systems.
  • The Infrastructure Team at Cohere focuses on building world-class infrastructure critical to the company's success, emphasizing stability, scalability, and observability.
  • The team optimizes for a wide range of technical skillsets and values self-direction, adaptability, and problem-solving abilities.
  • All infrastructure roles require participation in a 24x7 on-call rotation, with compensation provided.
  • Cohere offers a remote-friendly environment and strategically distributes teams based on interests, expertise, and time zones to promote collaboration and flexibility.

Requirements:

  • 5+ years of engineering experience running production infrastructure at a large scale.
  • Experience working with and supporting MLEs or data scientists.
  • Experience designing large, highly available distributed systems with Kubernetes and GPU workloads.
  • Experience in designing, deploying, supporting, and troubleshooting in complex Linux-based distributed computing environments.
  • For Storage Engineer role: experience synchronizing data between different cloud providers, working with distributed filesystems, and building internal tooling for data engineers.
  • For Analytics & Observability Engineer role: experience running analysis for technical teams, designing dashboards and reports, and using systems like Grafana, Prometheus, BigQuery, and Looker.

Benefits:

  • An open and inclusive culture and work environment.
  • Opportunity to work closely with a team on the cutting edge of AI research.
  • Weekly lunch stipend, in-office lunches, and snacks provided.
  • Full health and dental benefits, including a separate budget for mental health care.
  • 100% Parental Leave top-up for 6 months for employees in Canada, the US, and the UK.
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement.
  • Remote-flexible work environment with offices in Toronto, New York, San Francisco, and London, along with a co-working stipend.
  • 6 weeks of vacation offered.
Apply now
Please, let Cohere know you found this job on RemoteYeah . This helps us grow 🌱.
About the job
Report this job

Job expired or something else is wrong with this job?

Report this job
Leave a feedback