Remote Senior Software Engineer, Data

at Jobgether

Posted 1 day ago 3 applied

Description:

  • The position is for a Senior Software Engineer, Data at the Allen Institute for AI (AI2) located in Washington, USA.
  • The role focuses on advancing scientific discovery through large-scale data engineering.
  • Responsibilities include designing and building sophisticated data pipelines and machine learning-powered services that integrate vast patent and academic datasets.
  • The ideal candidate thrives in dynamic, collaborative environments and is passionate about data quality, scalability, and long-term maintainability.
  • The engineer will work with impactful tools used by millions of researchers worldwide, contributing directly to the future of open science and innovation.
  • Key accountabilities include building and maintaining scalable data pipelines using Airflow, developing and deploying lightweight machine learning models, training topic models, extending REST APIs, creating dashboards for data quality evaluation, collaborating with other engineers, and contributing to architecture discussions.

Requirements:

  • A Bachelor’s degree and 8+ years of relevant technical experience or an equivalent combination is required.
  • Expertise in Python for data engineering, including pipeline development and automation is necessary.
  • Proficiency in SQL and production-grade schema design, preferably with PostgreSQL, is required.
  • Hands-on experience with ML pipelines, including training, fine-tuning, and inference for structured data is essential.
  • Strong familiarity with structured data formats such as JSON, XML, and Parquet, as well as ETL practices is needed.
  • Experience with Airflow or similar workflow orchestration tools, AWS, and container technologies like Docker is required.
  • A strong ownership mindset and excellent communication skills are necessary.
  • Bonus experience includes entity resolution, author disambiguation, vector similarity techniques, scholarly datasets, and building internal APIs and dashboards for ML or data QA.

Benefits:

  • The base salary range is $146,880 – $220,320, with additional performance-based annual bonuses.
  • Comprehensive medical, dental, and vision insurance is provided for employees and their families.
  • Flexible spending accounts (FSA), HSA, and HRA plans are available.
  • A 401(k) retirement plan with employer contributions is offered.
  • Monthly stipends of $125 for internet/commuting and $200 for fitness/wellbeing are included.
  • A generous PTO policy includes up to 20 vacation days, 7 personal days, 10 sick days, and 12 paid holidays annually.
  • Remote work flexibility is available within the U.S.
  • The work environment emphasizes work-life balance, inclusion, and personal growth.