Remote Lead Machine Learning Infrastructure Engineer

Posted

Apply now
Please, let Unreal Gigs know you found this job on RemoteYeah. This helps us grow 🌱.

Description:

  • Lead the machine learning infrastructure initiatives and drive the design, development, and optimization of infrastructure solutions.
  • Provide technical leadership, mentorship, and guidance to a team of machine learning infrastructure engineers.
  • Design and optimize data pipelines for data ingestion, preprocessing, and transformation to support machine learning workflows.
  • Develop infrastructure for training machine learning models at scale using distributed computing frameworks and accelerators.
  • Lead the design and implementation of systems for deploying and managing machine learning models in production environments.
  • Implement monitoring and logging solutions to track performance and health of infrastructure and models.
  • Develop automation tools to streamline machine learning workflows and improve operational efficiency.
  • Ensure security controls and compliance with data privacy regulations in machine learning infrastructure.
  • Define best practices, promote documentation, and collaborate with cross-functional teams to deliver infrastructure solutions.
  • Mentor and coach junior engineers, fostering a culture of continuous learning and improvement within the team.

Requirements:

  • Bachelor's degree or higher in Computer Science, Engineering, Mathematics, or related field.
  • 8+ years of experience in infrastructure engineering, focusing on machine learning infrastructure.
  • Proven leadership experience in leading machine learning infrastructure teams and delivering complex projects.
  • Expertise in cloud platforms like AWS, Azure, or Google Cloud Platform and services such as AWS SageMaker, Azure Machine Learning, or Google AI Platform.
  • Strong programming skills in Python, Java, or Scala, with experience in distributed computing frameworks like Apache Spark or TensorFlow.
  • Familiarity with containerization technologies like Docker and container orchestration platforms such as Kubernetes.
  • Understanding of machine learning concepts, deploying, and managing models in production environments.
  • Strong problem-solving and analytical skills, with the ability to troubleshoot complex infrastructure issues.
  • Excellent communication and collaboration skills to work effectively in cross-functional teams.

Benefits:

  • Competitive salary ranging from $200,000 to $300,000 per year.
  • Comprehensive benefits package including health insurance, retirement plans, and wellness programs.
  • Flexible work arrangements with remote work options and flexible hours.
  • Generous vacation and paid time off.
  • Professional development opportunities with access to training programs, conferences, and workshops.
  • State-of-the-art technology environment with cutting-edge tools and resources.
  • Vibrant and inclusive company culture with growth and advancement opportunities.
  • Exciting projects with real-world impact at the forefront of AI-driven innovation.
Apply now
Please, let Unreal Gigs know you found this job on RemoteYeah . This helps us grow 🌱.
About the job
Posted on
Job type
Salary
$ 200,000 - 300,000 USD / year
Report this job

Job expired or something else is wrong with this job?

Report this job
Leave a feedback