Wynd Labs is an early-stage startup focused on making public web data accessible for AI through contributions to Grass, a network sharing application.
Grass allows users to share their unused bandwidth, creating a residential proxy network that rewards individual residential IPs for the bandwidth they provide.
The company operates with a lean, motivated team and a flat organizational structure, emphasizing autonomy and responsibility.
The Data Engineer will be responsible for building and maintaining robust data pipelines and integrating scalable infrastructure.
This role involves designing and optimizing data systems to ensure seamless data flow and accessibility, directly supporting Grass's mission in data-driven innovation.
Requirements:
A Bachelor’s degree in Computer Science, Information Systems, Data Engineering, or a related technical field is required.
Extensive experience with database systems such as Redshift, Snowflake, or similar cloud-based solutions is necessary.
Advanced proficiency in SQL and experience with optimizing complex queries for performance is essential.
Hands-on experience with building and managing data pipelines using tools like Apache Airflow, AWS Glue, or similar technologies is required.
A solid understanding of ETL (Extract, Transform, Load) processes and best practices for data integration is needed.
Experience with infrastructure automation tools (e.g., Terraform, CloudFormation) for managing data ecosystems is important.
Knowledge of programming languages such as Python, Scala, or Java for pipeline orchestration and data manipulation is required.
Strong analytical and problem-solving skills, with the ability to troubleshoot and resolve data flow issues, are essential.
Familiarity with containerization (e.g., Docker) and orchestration (e.g., Kubernetes) technologies for data infrastructure deployment is necessary.
A collaborative team player with strong communication skills to work with cross-functional teams is required.
Benefits:
The opportunity to work at the forefront of developing a web-scale crawler and knowledge graph, allowing ordinary people to participate in AI development.
A culture that prioritizes low ego and high output, with a lean team working towards ambitious goals of improving access to public web data.
Competitive salary and equity package are offered as part of the compensation.