This job post is closed and the position is probably filled. Please do not apply.
🤖 Automatically closed by a robot after apply link
was detected as broken.
Description:
The Data Engineer (DE) will play a crucial role in advancing the CDC Foundation's mission by designing, building, and maintaining data infrastructure for a public health organization.
This role is aligned to the Workforce Acceleration Initiative (WAI), a federally funded CDC Foundation program aimed at helping public health agencies improve their information systems.
The DE will be responsible for designing, developing, and maintaining robust Extract, Transform, and Load (ETL) pipelines and data architectures using Python, Apache Airflow, and other cutting-edge data engineering technologies.
The role involves implementing scalable data storage solutions and automating data workflows.
The DE will enforce data security best practices and troubleshoot data issues to ensure data quality and integrity.
Collaboration with data content experts, analysts, data scientists, data modelers, warehouse architects, IT staff, and other organization staff is essential to design and implement solutions that meet public health agency needs.
The position is remote, but candidates must be based in the United States, with preferred work hours from 8 am to 5 pm Central time zone.
Requirements:
A Bachelor’s degree in computer science, Information Technology, or a related field (or equivalent experience) is required.
A minimum of 5 years of experience as a Backend Data Engineer or in a similar role, with a strong understanding of ETL processes and data warehousing concepts, is necessary.
Proven experience with Python and related data engineering libraries (e.g., pandas, NumPy, Spark) and hands-on experience with Apache Airflow for managing data pipelines and workflows is essential.
Proficiency in programming languages commonly used in data engineering, such as Python, Java, Scala, or SQL, is required.
Experience with big data technologies and frameworks like Hadoop, Spark, Kafka, and Flink is necessary.
A strong understanding of database systems, including relational databases (e.g., MySQL, PostgreSQL) and NoSQL databases (e.g., MongoDB, Cassandra), is required.
Experience with engineering best practices such as source control, automated testing, continuous integration and deployment, and peer review is essential.
Knowledge of data warehousing concepts and tools is required.
Experience with cloud computing platforms is necessary.
Expertise in data modeling, ETL processes, and data integration techniques is required.
Familiarity with agile development methodologies, software design patterns, and best practices is necessary.
Strong analytical thinking and problem-solving abilities are essential.
Excellent verbal and written communication skills, including the ability to convey technical concepts to non-technical partners effectively, are required.
Flexibility to adapt to evolving project requirements and priorities is necessary.
Outstanding interpersonal and teamwork skills, with the ability to develop productive working relationships with colleagues and partners, are essential.
Experience working in a virtual environment with remote partners and teams is required.
Proficiency in Microsoft Office is necessary.
Benefits:
The salary range for this position is $103,500-$143,500 per year, plus benefits, with individual salary offers based on experience and qualifications unique to each candidate.
This position is grant funded and is a limited-term opportunity, with an end date of June 30, 2025.
The role offers a fully remote work arrangement for U.S.-based candidates.
The CDC Foundation provides a collaborative work environment focused on public health initiatives.
All qualified applicants will receive consideration for employment without discrimination based on race, color, religion, sex, national origin, age, mental or physical disabilities, veteran status, and other protected characteristics.