The company is a publicly traded entity at the forefront of the AI revolution, offering an AI-centric cloud platform that reshapes the landscape of artificial intelligence.
They provide cutting-edge infrastructure, including large-scale GPU clusters, cloud platforms, tools, and services for developers to support the growth of the global AI industry.
The mission is to democratize access to AI infrastructure and empower organizations to create, optimize, and deploy AI solutions at any scale.
The Senior AI/ML Specialist Solutions Architect will design and implement scalable AI solutions for AI-focused customers, working with state-of-the-art technologies.
Responsibilities include architecting and optimizing distributed training and inference systems for large-scale AI models, designing customer-focused solutions, leading the transition of ML pipelines from POC to production, building long-term customer relationships, creating whitepapers, delivering technical presentations, providing technical leadership, and collaborating with engineering and product teams.
Requirements:
Candidates must have 5+ years of experience with cloud technologies and infrastructure, ideally in senior MLOps or Solutions Architect roles.
Proven expertise in scaling and optimizing AI workloads across multi-node and multi-GPU environments is required.
Demonstrated success in delivering ML products, scaling from POC to production, is essential.
Deep knowledge of ML frameworks like PyTorch and JAX is necessary.
A strong background in the NVIDIA HPC ecosystem (CUDA, NCCL, Infiniband) is required.
Exceptional communication skills to engage both technical teams and business stakeholders are a must.
Legal authorization to work in the United States on a full-time basis without sponsorship is required.
Benefits:
The position offers competitive compensation ranging from $180,000 to $300,000 per year, negotiable based on experience and location.
Full medical benefits include 100% company-paid medical, dental, and vision coverage for employees and their families.
A 401(k) plan with a 4% match program is provided.
Employees can participate in a stock options plan.
The company offers a flexible remote work environment.
Company-paid short-term, long-term disability, and life insurance coverage are included.
There is a generous parental leave policy, with 20 weeks paid leave for primary caregivers and 12 weeks for secondary caregivers.
Employees receive up to $85/month for mobile and internet expenses.
The opportunity to work with state-of-the-art AI and cloud technologies, including the latest NVIDIA GPUs, is available.
Employees will be part of a team that operates one of the most powerful commercially available supercomputers.
The company contributes to sustainable AI infrastructure, with energy-efficient data centers that recover waste heat to warm nearby residential buildings.