AirGarage is seeking a Software Engineer to manage the reliability, health, and observability of its nationwide IoT device fleet.
The role involves working with embedded systems, backend infrastructure, and site reliability engineering.
Responsibilities include designing and building tools, monitoring pipelines, and automation to ensure hundreds of devices remain online and perform reliably.
The engineer will architect and ship production code, build internal platforms for fleet monitoring and diagnostics, and apply debugging skills to resolve issues.
The technology stack includes Embedded Linux (Debian, Yocto), Python, C++, DataDog, SQL, Postgres, Snowflake, AWS, GCP, Docker, RabbitMQ, and Github Actions.
Key tasks include designing systems for monitoring IoT device health, building internal tools for device setup and QA automation, and investigating fleet-wide issues through metrics and logs.
The engineer will also conduct health assessments, collaborate with operations and hardware teams, and maintain troubleshooting guides.
The position requires occasional travel (approximately 20%) for QA, deployments, and on-site debugging.
Requirements:
Candidates must have 3+ years of professional software engineering experience.
Strong proficiency in Python and SQL is required, with experience in shipping production-quality code; a background in C++ is a plus.
Experience managing distributed Linux-based hardware appliances or IoT fleets is necessary.
Familiarity with observability and monitoring tools such as DataDog, OpenTelemetry, Prometheus, and Grafana is essential.
A track record of building internal tooling, monitoring, or reliability platforms is required.
Hands-on experience with Linux systems and debugging distributed hardware/software environments is needed.
A background in cellular and wireless connectivity technologies is preferred.
Excellent written and verbal communication skills are necessary for translating technical findings into clear reports.
Candidates should be self-starters who thrive in fast-paced, ownership-driven environments and be willing to travel for troubleshooting.
Benefits:
Employees will receive equity in the company, allowing them to have a stake in the business they help build.
The position is fully remote, allowing employees to work from anywhere in North America.
Health insurance is provided, covering 85% of the cost for the primary employee and 50% for dependents.
A home office setup is included, providing a laptop and necessary equipment for success.
The company offers an unlimited PTO policy with a minimum requirement of 10 days per year.
A 401k retirement savings program is available for financial planning.
Team off-sites occur approximately twice a year in various locations.
Employees can participate in a recurring book club for personal and professional growth.
There are opportunities for career growth as the team expands significantly in the coming years.
Employees have the chance to impact urban real estate use and work with a diverse team committed to inclusivity.