Solvd is an AI-first advisory and digital engineering firm focused on delivering measurable business impact through strategic digital transformation.
The company aims to bridge the gap between experimentation and real ROI by integrating artificial intelligence into various processes.
Solvd is seeking a Senior AI Infrastructure Engineer to design, build, and scale AI and data infrastructure.
The role involves architecting and maintaining cloud-based MLOps pipelines for scalable, reliable, and production-grade AI/ML workflows.
The engineer will collaborate closely with AI engineers, data engineers, and platform teams.
The position requires expertise in building and operating modern cloud-native infrastructure to enable world-class AI capabilities.
Requirements:
Candidates must have 7+ years of professional experience in software engineering and infrastructure engineering.
Extensive experience in building and maintaining AI/ML infrastructure in production, including model, deployment, and lifecycle management is required.
Strong knowledge of AWS and infrastructure-as-code frameworks, ideally with CDK, is necessary.
Expert-level coding skills in TypeScript and Python for building robust APIs and backend services are essential.
Production-level experience with Databricks MLFlow, including model registration, versioning, asset bundles, and model serving workflows is mandatory.
Candidates should have an expert-level understanding of containerization (Docker) and hands-on experience with CI/CD pipelines and orchestration tools (e.g., ECS) is a plus.
Proven ability to design reliable, secure, and scalable infrastructure for both real-time and batch ML workloads is required.
The ability to articulate ideas clearly, present findings persuasively, and build rapport with clients and team members is essential.
Strong collaboration skills and the ability to partner effectively with cross-functional teams are necessary.
Familiarity with emerging LLM frameworks such as DSPy for advanced prompt orchestration and programmatic LLM pipelines is preferred.
Understanding of LLM cost monitoring, latency optimization, and usage analytics in production environments is required.
Knowledge of vector databases/embeddings stores (e.g., OpenSearch) to support semantic search and RAG is necessary.
Benefits:
Employees will be part of a global team with offices in the USA, Poland, Ukraine, Georgia, and LATAM.
The company promotes an AI-first approach, empowering passionate individuals to thrive in the era of AI.
Solvd maintains rigorous ethical AI standards, ensuring a responsible work environment.
The role offers the opportunity to work on cutting-edge AI technologies and contribute to significant digital transformation projects.