Remote Senior AI/ML Specialist Solutions Architect (Cloud & AI Infra)
Posted
Apply now
Please, let Lavendo know you found this job
on RemoteYeah.
This helps us grow π±.
Description:
Our client is a publicly traded company at the forefront of the AI revolution, offering an AI-centric cloud platform that is reshaping the landscape of artificial intelligence.
The company provides cutting-edge infrastructure, including large-scale GPU clusters, cloud platforms, tools, and services for developers to service the explosive growth of the global AI industry for Fortune 1000 companies, top-tier innovative startups, and AI researchers.
We are seeking a Senior AI/ML Specialist Solutions Architect to join our client's team.
This role offers the chance to design and implement scalable AI solutions for AI-focused customers, working with state-of-the-art technologies and contributing to one of the most powerful commercially available supercomputers.
Responsibilities include architecting and optimizing distributed training and inference systems for large-scale AI models, designing customer-focused solutions, leading the transition of ML pipelines from POC to production, building long-term customer relationships, creating whitepapers, delivering technical presentations, providing technical leadership, and collaborating with engineering and product teams.
Requirements:
Candidates must have 5+ years of experience with cloud technologies and infrastructure, ideally in senior MLOps or Solutions Architect roles.
Proven expertise in scaling and optimizing AI workloads across multi-node and multi-GPU environments is required.
Demonstrated success in delivering ML products, scaling from POC to production, is essential.
Deep knowledge of ML frameworks like PyTorch and JAX is necessary.
A strong background in the NVIDIA HPC ecosystem (CUDA, NCCL, Infiniband) is required.
Active involvement in the ML community, including public speaking, open-source contributions, and competitions like Kaggle and Hackathons, is preferred.
Exceptional communication skills to engage both technical teams and business stakeholders are a must.
Legal authorization to work in the United States on a full-time basis without sponsorship is required.
Benefits:
The position offers competitive compensation ranging from $180,000 to $300,000 per year, negotiable based on experience and location.
Full medical benefits include 100% company-paid medical, dental, and vision coverage for employees and their families.
A 401(k) plan with a 4% match program is provided.
Employees are eligible for a stock options plan.
The company offers a flexible remote work environment.
Company-paid short-term, long-term disability, and life insurance coverage are included.
Paid parental leave is available, with 20 weeks for primary caregivers and 12 weeks for secondary caregivers.
Up to $85 per month is provided for mobile and internet expenses.
Employees will work with state-of-the-art AI and cloud technologies, including the latest NVIDIA GPUs.
The opportunity to be part of a team that operates one of the most powerful commercially available supercomputers is available.
Employees will contribute to sustainable AI infrastructure, with energy-efficient data centers that recover waste heat to warm nearby residential buildings.
Apply now
Please, let Lavendo know you found this job
on RemoteYeah
.
This helps us grow π±.