Please let Cohere know you found this job on RemoteYeah. This helps us get more companies to post jobs here for you.
Description:
Join Cohere as a Machine Learning Engineer specializing in synthetic data to develop and manage the synthetic data pipeline for advanced language models.
Responsibilities include maintaining and optimizing data pipelines, conducting data analysis, and improving model quality through innovative data curation methods.
Requirements:
Strong software engineering skills in Python and experience building data pipelines.
Familiarity with data processing frameworks (e.g., Apache Spark, Pandas) and LLM inference frameworks (e.g., vLLM, TensorRT).
Experience with large-scale datasets and a passion for bridging research and engineering in AI model training.
Bonus: Publications at top-tier venues (NeurIPS, ICML, etc.).
Benefits:
Open and inclusive culture with a focus on collaboration in AI research.
Weekly lunch stipend, full health and dental benefits, and mental health budget.
100% parental leave top-up for up to 6 months and personal enrichment benefits.
Remote-flexible work options with offices in major cities and 6 weeks of vacation.