Remote Sr. Deployment Engineer, AI Inference

Posted 6 months ago

Share:

Please let Cerebras Systems know you found this job on RemoteYeah. This helps us get more companies to post jobs here for you.

Description:

  • Cerebras Systems builds the world's largest AI chip, which is 56 times larger than GPUs, providing the AI compute power of dozens of GPUs on a single chip.
  • The company delivers industry-leading training and inference speeds, allowing machine learning users to run large-scale ML applications without managing multiple GPUs or TPUs.
  • Current customers include global corporations, national labs, and top-tier healthcare systems, with a notable partnership with Mayo Clinic.
  • The role of Sr. Deployment Engineer involves building and operating cutting-edge inference clusters using the Wafer-Scale Engine (WSE).
  • The engineer will ensure reliable, efficient, and scalable deployment of AI inference workloads across global infrastructure.
  • Responsibilities include deploying AI inference replicas, operating across rapidly growing datacenter environments, maximizing capacity allocation, and developing telemetry and observability solutions.
  • The position does not require 24/7 on-call rotations.

Requirements:

  • Candidates should have 5-7 years of experience in operating on-prem compute infrastructure, ideally in Machine Learning or High-Performance Compute, or in developing and managing complex AWS infrastructure for hybrid deployments.
  • Strong proficiency in Python for automation, orchestration, and deployment tooling is required.
  • A solid understanding of Linux-based systems and command-line tools is necessary.
  • Extensive knowledge of Docker containers and container orchestration platforms like K8S is essential.
  • Familiarity with spine-leaf (Clos) networking architecture is preferred.
  • Proficiency with telemetry and observability stacks such as Prometheus, InfluxDB, and Grafana is required.
  • Candidates should demonstrate a strong ownership mindset and accountability for complex deployments.
  • The ability to work effectively in a fast-paced environment is essential.

Benefits:

  • Employees have the opportunity to build a breakthrough AI platform beyond the constraints of the GPU.
  • Team members can publish and open source their cutting-edge AI research.
  • Employees work on one of the fastest AI supercomputers in the world.
  • The company offers job stability combined with startup vitality.
  • Cerebras promotes a simple, non-corporate work culture that respects individual beliefs.

Job title

Job type

Experience level

Required experience

5 years

Salary

-

Degree requirement

No degree required

Location requirements

Benefits

-

Report this job

Job expired or something else is wrong with this job?

Report job
SerpApi

SerpApi

Scrape Google and other search engines from our fast, easy, and complete API.

RemoteYeah Ads