Remote Senior AI Infrastructure Engineer (Databricks, AWS, Python)

at Solvd

Posted 1 day ago 1 applied

Description:

  • Solvd is an AI-first advisory and digital engineering firm focused on delivering measurable business impact through strategic digital transformation.
  • The company aims to bridge the gap between experimentation and real ROI by integrating artificial intelligence into various processes.
  • Solvd is seeking a Senior AI Infrastructure Engineer to design, build, and scale AI and data infrastructure.
  • The role involves architecting and maintaining cloud-based MLOps pipelines for scalable, reliable, and production-grade AI/ML workflows.
  • The engineer will collaborate closely with AI engineers, data engineers, and platform teams.
  • The position requires expertise in building and operating modern cloud-native infrastructure to enable world-class AI capabilities.

Requirements:

  • Candidates must have 7+ years of professional experience in software engineering and infrastructure engineering.
  • Extensive experience in building and maintaining AI/ML infrastructure in production, including model, deployment, and lifecycle management is required.
  • Strong knowledge of AWS and infrastructure-as-code frameworks, ideally with CDK, is necessary.
  • Expert-level coding skills in TypeScript and Python for building robust APIs and backend services are essential.
  • Production-level experience with Databricks MLFlow, including model registration, versioning, asset bundles, and model serving workflows is mandatory.
  • Candidates should have an expert-level understanding of containerization (Docker) and hands-on experience with CI/CD pipelines and orchestration tools (e.g., ECS) is a plus.
  • Proven ability to design reliable, secure, and scalable infrastructure for both real-time and batch ML workloads is required.
  • The ability to articulate ideas clearly, present findings persuasively, and build rapport with clients and team members is essential.
  • Strong collaboration skills and the ability to partner effectively with cross-functional teams are necessary.
  • Familiarity with emerging LLM frameworks such as DSPy for advanced prompt orchestration and programmatic LLM pipelines is preferred.
  • Understanding of LLM cost monitoring, latency optimization, and usage analytics in production environments is required.
  • Knowledge of vector databases/embeddings stores (e.g., OpenSearch) to support semantic search and RAG is necessary.

Benefits:

  • Employees will be part of a global team with offices in the USA, Poland, Ukraine, Georgia, and LATAM.
  • The company promotes an AI-first approach, empowering passionate individuals to thrive in the era of AI.
  • Solvd maintains rigorous ethical AI standards, ensuring a responsible work environment.
  • The role offers the opportunity to work on cutting-edge AI technologies and contribute to significant digital transformation projects.