Remote Data Engineer

at Wynd Labs

Posted 4 days ago 11 applied

Description:

  • Wynd Labs is an early-stage startup focused on making public web data accessible for AI through contributions to Grass, a network sharing application.
  • Grass allows users to share their unused bandwidth, creating a residential proxy network that rewards individual residential IPs for the bandwidth they provide.
  • The company operates with a lean, motivated team and a flat organizational structure, emphasizing autonomy and responsibility.
  • The Data Engineer will be responsible for building and maintaining robust data pipelines and integrating scalable infrastructure.
  • This role involves designing and optimizing data systems to ensure seamless data flow and accessibility, directly supporting Grass's mission in data-driven innovation.

Requirements:

  • A Bachelor’s degree in Computer Science, Information Systems, Data Engineering, or a related technical field is required.
  • Extensive experience with database systems such as Redshift, Snowflake, or similar cloud-based solutions is necessary.
  • Advanced proficiency in SQL and experience with optimizing complex queries for performance is essential.
  • Hands-on experience with building and managing data pipelines using tools like Apache Airflow, AWS Glue, or similar technologies is required.
  • A solid understanding of ETL (Extract, Transform, Load) processes and best practices for data integration is needed.
  • Experience with infrastructure automation tools (e.g., Terraform, CloudFormation) for managing data ecosystems is important.
  • Knowledge of programming languages such as Python, Scala, or Java for pipeline orchestration and data manipulation is required.
  • Strong analytical and problem-solving skills, with the ability to troubleshoot and resolve data flow issues, are essential.
  • Familiarity with containerization (e.g., Docker) and orchestration (e.g., Kubernetes) technologies for data infrastructure deployment is necessary.
  • A collaborative team player with strong communication skills to work with cross-functional teams is required.

Benefits:

  • The opportunity to work at the forefront of developing a web-scale crawler and knowledge graph, allowing ordinary people to participate in AI development.
  • A culture that prioritizes low ego and high output, with a lean team working towards ambitious goals of improving access to public web data.
  • Competitive salary and equity package are offered as part of the compensation.