Remote Lead Data Engineer | GenAI

Posted

This job is closed

This job post is closed and the position is probably filled. Please do not apply.  Automatically closed by a robot after apply link was detected as broken.

Description:

  • The Lead Data Engineer position at Wellhub involves developing and implementing data cleaning procedures to ensure high-quality data for AI model training.
  • Responsibilities include identifying and rectifying data inconsistencies, errors, and anomalies, as well as performing data normalization, transformation, and augmentation as needed.
  • The role also entails designing and building scalable data pipelines to automate data collection, cleaning, and preprocessing tasks.
  • Collaboration with data engineers to integrate data cleaning processes into the overall data pipeline is essential.
  • Establishing and enforcing data quality standards and best practices, developing and maintaining data validation and verification routines, and monitoring data quality metrics are key aspects of the position.
  • The Lead Data Engineer will also be responsible for developing and maintaining tools and scripts for data cleaning and preprocessing, automating repetitive data cleaning tasks, and staying updated with the latest tools and techniques in data cleaning and preprocessing.
  • Collaboration with data scientists, AI researchers, and other engineers to understand data requirements and challenges, participation in cross-functional team meetings, and effective communication of data quality issues and solutions to stakeholders are crucial responsibilities.
  • Leadership tasks include driving critical projects from architectural design to implementation, planning and successfully delivering cross-team projects, and providing estimates on efforts and risks for high-impact projects.
  • The Lead Data Engineer is expected to live the mission of inspiring and empowering others by caring for their own wellbeing and creating a supportive environment where work-life balance is encouraged.

Requirements:

  • Bachelor’s or Master’s degree in Computer Science, Data Science, Engineering, or a related field is required.
  • Proven experience in software development with a focus on data processing and data integration is mandatory.
  • Strong programming skills in languages such as Python, SQL, or Java are necessary.
  • Experience with ETL and data processing frameworks and libraries (e.g., Flink, Spark) is essential.
  • Familiarity with machine learning and AI concepts, particularly related to generative AI, is required.
  • Knowledge of data pipeline tools and technologies (e.g., Airflow, EMR, Kafka) is necessary.
  • Excellent problem-solving skills, attention to detail, and strong communication skills in English and Portuguese are mandatory.
  • Experience working with modern agile product development teams and leadership skills are essential.

Benefits:

  • Access to digital fitness programs, online wellness resources for meditation, nutrition, mental health support, and more through the Wellhub platform.
  • Additional fitness subsidy for onsite gyms and fitness studios.
  • Flexible work options including hybrid and full remote models, with a home office stipend and monthly flexible work allowance.
  • Flexible schedule to adjust working hours based on personal schedule, time zone, and business needs.
  • Minimum of 25 days paid holiday per year with additional days based on tenure, annual holidays, and paid parental leave.
  • Opportunities for personal and career growth with a growth mindset and deep investment in employee development.
  • A supportive and inclusive work environment with a diverse team from around the world, fostering a culture of trust, flexibility, and integrity.
About the job
Posted on
Job type
Salary
-
Leave a feedback