Please, let Halo Media know you found this job
on RemoteYeah.
This helps us grow 🌱.
Description:
Halo is an international agency with over 200 people specializing in interactive media strategy and development, focusing on innovation by inclusion to solve digital problems.
The company values equity and empowerment, fostering deep, meaningful relationships with clients.
As a Data Scientist, you will be part of a multidisciplinary team applying advanced analytics, machine learning, and generative AI to solve real-world problems across various sectors including consulting, health, wealth, and career businesses.
You will collaborate closely with engineering, product, and business stakeholders to develop scalable models, design intelligent pipelines, and influence data-driven decision-making across the enterprise.
Requirements:
You must design, develop, and deploy robust machine learning models and data pipelines that support AI-enabled applications.
You should apply exploratory data analysis (EDA) and feature engineering techniques to extract insights and improve model performance.
Collaboration with cross-functional teams to translate business problems into analytical use cases is essential.
You will contribute to the full machine learning lifecycle, from data preparation and model experimentation to deployment and monitoring.
Experience working with structured and unstructured data, including text, to develop NLP and generative AI solutions is required.
You must define and enforce best practices in model validation, reproducibility, documentation, and versioning.
Partnering with engineering to integrate models into production systems using CI/CD pipelines and cloud-native services is necessary.
Staying current with industry trends, emerging techniques (e.g., RAG, LLMs, embeddings), and relevant tools is expected.
A minimum of 3 years of experience in Data Science, Machine Learning, or Applied AI roles is required.
Proficiency in Python (preferred) and a strong grasp of pandas, NumPy, and scikit-learn is necessary.
You should be skilled in data querying, manipulation, and pipeline development using SQL and modern ETL frameworks.
Experience working with Databricks, including notebooks, MLflow, Delta Lake, and job orchestration is required.
Familiarity with Git-based workflows and Agile methodologies is essential.
Strong analytical thinking, problem-solving skills, and communication abilities are necessary.
Exposure to Generative AI, LLMs, prompt engineering, or vector-based search is preferred.
Hands-on experience with cloud platforms (AWS, Azure, or GCP) and deploying models in scalable environments is required.
Knowledge of data versioning, model registry, and ML lifecycle tools (e.g., MLflow, DVC, SageMaker, DataBricks, or Vertex AI) is necessary.
Experience working with visualization tools like Tableau, Power BI, or Qlik is required.
A degree in Computer Science, Data Science, Applied Mathematics, or a related field is necessary.
Benefits:
The position offers 100% remote work.
Salary will be provided in USD.
You will have the opportunity to work on challenging projects for the U.S.
Apply now
Please, let Halo Media know you found this job
on RemoteYeah
.
This helps us grow 🌱.