This job post is closed and the position is probably filled. Please do not apply.
π€ Automatically closed by a robot after apply link
was detected as broken.
Description:
The Jr. Data Engineer will be responsible for writing and deploying crawling scripts to collect source data from the web.
They will write and run data transformers in Scala Spark to standardize bulk data sets.
The role involves writing and running modules in Python to parse entity references and relationships from source data.
Diagnosing and fixing bugs reported by internal and external users is part of the responsibilities.
Analyzing and reporting on internal datasets to answer questions and inform feature work.
Working collaboratively on and across a team of engineers using agile principles.
Giving and receiving feedback through code reviews.
Requirements:
Professional experience with Python and a JVM language (e.g., Scala).
2+ years of experience designing and maintaining data pipelines.
Experience using Apache Spark and Apache Airflow.
Familiarity with SQL and NoSQL databases (e.g., columns stores, graph, etc.).
Experience working on a cloud platform like GCP, AWS, or Azure.
Proficiency in working collaboratively with Git.
Understanding of Docker/Kubernetes.
Interest in learning from and mentoring team members.
Experience supporting and working with cross-functional teams in a dynamic environment.
Passion for open source development and innovative technology.
Experience with BI tools like BigQuery and Superset is a plus.
Understanding of knowledge graphs is a plus.
Benefits:
Limitless growth and learning opportunities.
A collaborative and positive culture with smart and driven team members.
Strong commitment to diversity, equity & inclusion.
Generous vacation leave, parental leave, floating holidays, flexible schedule, and other remarkable benefits.
Competitive compensation and commission package.
Comprehensive family-friendly health benefits, including full healthcare coverage plans, commuter benefits, and 401K matching.
Equal opportunity employer with a focus on diversity and inclusion, ensuring no discrimination or harassment based on various factors.