Remote Data Engineer Intern - Web Crawling

Posted

Apply now
Please, let Sayari know you found this job on RemoteYeah. This helps us grow 🌱.

Description:

  • Sayari is seeking a Data Engineer Intern specializing in web crawling to join its Data Engineering team.
  • The internship involves maintaining and improving Sayari’s web crawling framework, focusing on scalability and reliability.
  • The intern will work with the Product and Software Engineering teams to ensure the crawling deployment meets product requirements and integrates efficiently with ETL pipelines.
  • Responsibilities include investigating and implementing web crawlers for new sources, maintaining and improving existing crawling infrastructure, improving metrics and reporting for web crawling, helping improve and maintain ETL processes, and contributing to the development and design of Sayari’s data product.
  • This is a remote paid internship with work expectations of 20-30 hours a week.

Requirements:

  • Candidates must have experience with Python.
  • Experience managing web crawling at scale is required, with familiarity in any framework; experience with Scrapy is a plus.
  • Candidates should have experience working with Kubernetes.
  • Experience working collaboratively with git is necessary.
  • Candidates must be familiar with selectors such as XPath, CSS, and JMESPath.
  • Experience with WebDev tools like Chrome or Firefox is required.

Benefits:

  • The position offers a competitive hourly wage of $20 - $25.
  • Interns will gain hands-on experience in a dynamic and supportive work environment.
  • The internship provides opportunities for training and learning, fostering professional growth.
  • Interns will work with a high-performing and curious team, enhancing their collaborative skills.
Apply now
Please, let Sayari know you found this job on RemoteYeah . This helps us grow 🌱.
About the job
Posted on
Job type
Salary
$ 20 - 25 USD / hour
Report this job

Job expired or something else is wrong with this job?

Report this job
Leave a feedback