Remote ML Infrastructure Engineer

at VantAI

Posted 1 day ago 2 applied

Description:

  • VantAI is seeking talented engineers to join their core application team, focusing on solving complex scientific problems in biology, chemistry, physics, data science, and computational science.
  • The role involves developing large-scale scientific workflows, supporting scientists on existing workflows, and identifying areas for increased automation to reduce inefficiencies.
  • Projects may include developing cheminformatics tools and algorithms, large-scale virtual screening, protein-protein docking, creating scalable on-demand ML inference infrastructure, and automating modeling, analysis, and visualization of simulations.
  • The tech stack includes languages and frameworks such as Python, Go, Rust, React, and operates in Docker/Kubernetes on GCP.
  • Candidates should possess versatility and a willingness to learn across the tech stack, with a focus on making an impact and solving challenging problems.
  • Outcomes for this role include shaping the product by spearheading new features, developing software tools to improve processes, contributing to company culture, and continuous learning from colleagues.

Requirements:

  • Candidates must have experience working in complex codebases and cloud-native architectures.
  • Proficiency in container orchestration and Kubernetes cluster development is required.
  • Demonstrated experience in creating and managing infrastructure for ML model registry, training, inference, and deployment is essential.
  • A strong familiarity with best practices and the current state of the art for ML infrastructure is necessary.
  • Experience supporting large-scale model architectures with complex environment requirements is required.
  • Candidates should have skills in data architecture and engineering.
  • Demonstrated experience with parallel computing and distributed model architectures is needed.
  • The ability to break apart existing code and solve ad hoc problems independently is essential.
  • A willingness to work with existing codebases is required.
  • Candidates should possess a strong sense of independence and ownership, with no task being too big or small.
  • The ability to provide guidance on implementation considerations while being open to compromise is necessary.
  • Experience in developing/supporting ML models in cheminformatics, bioinformatics, large language, and/or diffusion model domains is desired.
  • Experience working on drug discovery projects is also desired.

Benefits:

  • The salary range for this position in NYC is $120,000 - $190,000, reflecting the job description as written.
  • Candidates seeking a higher salary are encouraged to apply, as the company is open to discussing experience and compensation during the initial contact.