Please, let Sayari know you found this job
on RemoteYeah.
This helps us grow π±.
Description:
Sayari is seeking a Data Engineering Intern to join its Data Engineering team.
The intern will work with the Product and Software Engineering teams to collect data globally, maintain existing ETL pipelines, and develop new pipelines for Sayari Graph.
Sayari Graph provides instant access to structured business information from billions of corporate, legal, and trade records.
The application tier is built primarily in TypeScript, running in Kubernetes, and is backed by Postgres, Cassandra, Elasticsearch, and Memgraph.
The data ingest tier operates on Spark, processing terabytes of data from hundreds of sources.
The platform allows users to explore a large knowledge graph sourced from hundreds of millions of records in over 200 countries and 30 languages.
The intern will have the opportunity to contribute to open-source projects, including the WebGL-powered network visualization library Trellis.
This is a remote paid internship with work expectations of 20-30 hours per week.
Requirements:
Candidates must have experience with Python and/or a JVM language such as Scala.
Experience working collaboratively with git is required.
Desired skills include experience with Apache Spark and Apache Airflow.
Familiarity with cloud platforms like GCP, AWS, or Azure is preferred.
An understanding of or interest in knowledge graphs is also desired.
Benefits:
Sayari offers a collaborative and positive culture where team members are smart and driven.
There are limitless growth and learning opportunities available.
The company has a strong commitment to diversity, equity, and inclusion.
Team building events and opportunities are provided.
The pay for this internship ranges from $20 to $25 an hour.
Apply now
Please, let Sayari know you found this job
on RemoteYeah
.
This helps us grow π±.