Remote Senior AI/ML Specialist Solutions Architect (Cloud & AI Infra)

Posted

Apply now
Please, let Lavendo know you found this job on RemoteYeah. This helps us grow 🌱.

Description:

  • Our client is a publicly traded company at the forefront of the AI revolution, offering an AI-centric cloud platform that is reshaping the landscape of artificial intelligence.
  • The company provides cutting-edge infrastructure, including large-scale GPU clusters, cloud platforms, tools, and services for developers to service the explosive growth of the global AI industry for Fortune 1000 companies, top-tier innovative startups, and AI researchers.
  • We are seeking a Senior AI/ML Specialist Solutions Architect to join our client's team.
  • This role offers the chance to design and implement scalable AI solutions for AI-focused customers, working with state-of-the-art technologies and contributing to one of the most powerful commercially available supercomputers.
  • Responsibilities include architecting and optimizing distributed training and inference systems for large-scale AI models, designing customer-focused solutions, leading the transition of ML pipelines from POC to production, building long-term customer relationships, creating whitepapers, delivering technical presentations, providing technical leadership, and collaborating with engineering and product teams.

Requirements:

  • Candidates must have 5+ years of experience with cloud technologies and infrastructure, ideally in senior MLOps or Solutions Architect roles.
  • Proven expertise in scaling and optimizing AI workloads across multi-node and multi-GPU environments is required.
  • Demonstrated success in delivering ML products, scaling from POC to production, is essential.
  • Deep knowledge of ML frameworks like PyTorch and JAX is necessary.
  • A strong background in the NVIDIA HPC ecosystem (CUDA, NCCL, Infiniband) is required.
  • Active involvement in the ML community, including public speaking, open-source contributions, and competitions like Kaggle and Hackathons, is preferred.
  • Exceptional communication skills to engage both technical teams and business stakeholders are a must.
  • Legal authorization to work in the United States on a full-time basis without sponsorship is required.

Benefits:

  • The position offers competitive compensation ranging from $180,000 to $300,000 per year, negotiable based on experience and location.
  • Full medical benefits include 100% company-paid medical, dental, and vision coverage for employees and their families.
  • A 401(k) plan with a 4% match program is provided.
  • Employees are eligible for a stock options plan.
  • The company offers a flexible remote work environment.
  • Company-paid short-term, long-term disability, and life insurance coverage are included.
  • Paid parental leave is available, with 20 weeks for primary caregivers and 12 weeks for secondary caregivers.
  • Up to $85 per month is provided for mobile and internet expenses.
  • Employees will work with state-of-the-art AI and cloud technologies, including the latest NVIDIA GPUs.
  • The opportunity to be part of a team that operates one of the most powerful commercially available supercomputers is available.
  • Employees will contribute to sustainable AI infrastructure, with energy-efficient data centers that recover waste heat to warm nearby residential buildings.
Apply now
Please, let Lavendo know you found this job on RemoteYeah . This helps us grow 🌱.
Report this job

Job expired or something else is wrong with this job?

Report this job
Leave a feedback