Welcome to RemoteYeah 2.0! Find out more about the new version here.

Remote Sr Backend Data Developer - Scraping and Data Extraction Automation Specialist

at Workana

Posted 11 hours ago 0 applied

Description:

  • Workana is seeking a Senior Python Developer with strong expertise in scraping to join the Data Science team of a leading technology company that drives modern logistics commerce.
  • The ideal candidate should have a technical mindset focused on execution, with a strong ability to organize code cleanly and efficiently, and solve problems in unstructured environments.
  • Candidates should be able to approach current challenges with alternative solutions, not only maintaining existing scrapers but also proposing more scalable, robust, and innovative solutions.
  • The work will involve hundreds of different portals with varying levels of complexity, requiring not only scraping skills but also a deep understanding of the business, advanced logical thinking, and resilience in code design.
  • Main responsibilities include developing robust scrapers for various dynamic data sources, optimizing scraping processes using techniques to evade blocks, participating in the construction of data ingestion and processing pipelines, integrating automation into CI/CD flows, deploying solutions in the cloud, documenting code clearly, and collaborating with other technical team members.

Requirements:

  • A minimum of 5 years of proven experience developing backend solutions with Python, including knowledge of structures, asynchronous programming, concurrency, and small-scale distributed processing.
  • At least 3 years of experience developing complex scrapers, ideally with experience in mass automation, evasion of blocks, and maintenance of multiple sources.
  • Practical experience in developing scrapers using Playwright, Selenium, Scrapy, Requests, or similar libraries.
  • Knowledge of evasion techniques for blocks, including proxy rotation, captchas, and user-agents.
  • Familiarity with CI/CD pipelines, such as GitHub Actions or Jenkins.
  • Experience in cloud deployment, ideally with AWS or GCP.
  • Proficient in SQL databases, such as PostgreSQL or MySQL.
  • Understanding of data structures and best documentation practices.
  • An innovative and problem-solving mindset, with a preference for candidates who have experience with AI tools and think about concurrency, performance, and efficiency.
  • Potential for leadership, with the ability to assume a technical leadership role over time.

Benefits:

  • The position is 100% remote, allowing for flexible work arrangements.
  • It is a full-time role with a contract duration of 6 months, with the possibility of extension based on performance.
  • The salary ranges from 3,000 to 3,500 USD per month, providing competitive compensation for the role.