Remote Lead Machine Learning Infrastructure Engineer
Posted
Apply now
Please, let Unreal Gigs know you found this job
on RemoteYeah.
This helps us grow π±.
Description:
Lead the machine learning infrastructure initiatives and drive the design, development, and optimization of infrastructure solutions.
Provide technical leadership, mentorship, and guidance to a team of machine learning infrastructure engineers.
Design and optimize data pipelines for data ingestion, preprocessing, and transformation to support machine learning workflows.
Develop infrastructure for training machine learning models at scale using distributed computing frameworks and accelerators.
Lead the design and implementation of systems for deploying and managing machine learning models in production environments.
Implement monitoring and logging solutions to track performance and health of infrastructure and models.
Develop automation tools to streamline machine learning workflows and improve operational efficiency.
Ensure security controls and compliance with data privacy regulations in machine learning infrastructure.
Define best practices, promote documentation, and collaborate with cross-functional teams to deliver infrastructure solutions.
Mentor and coach junior engineers, fostering a culture of continuous learning and improvement within the team.
Requirements:
Bachelor's degree or higher in Computer Science, Engineering, Mathematics, or related field.
8+ years of experience in infrastructure engineering, focusing on machine learning infrastructure.
Proven leadership experience in leading machine learning infrastructure teams and delivering complex projects.
Expertise in cloud platforms like AWS, Azure, or Google Cloud Platform and services such as AWS SageMaker, Azure Machine Learning, or Google AI Platform.
Strong programming skills in Python, Java, or Scala, with experience in distributed computing frameworks like Apache Spark or TensorFlow.
Familiarity with containerization technologies like Docker and container orchestration platforms such as Kubernetes.
Understanding of machine learning concepts, deploying, and managing models in production environments.
Strong problem-solving and analytical skills, with the ability to troubleshoot complex infrastructure issues.
Excellent communication and collaboration skills to work effectively in cross-functional teams.
Benefits:
Competitive salary ranging from $200,000 to $300,000 per year.
Comprehensive benefits package including health insurance, retirement plans, and wellness programs.
Flexible work arrangements with remote work options and flexible hours.
Generous vacation and paid time off.
Professional development opportunities with access to training programs, conferences, and workshops.
State-of-the-art technology environment with cutting-edge tools and resources.
Vibrant and inclusive company culture with growth and advancement opportunities.
Exciting projects with real-world impact at the forefront of AI-driven innovation.
Apply now
Please, let Unreal Gigs know you found this job
on RemoteYeah
.
This helps us grow π±.