Please let Clutch know you found this job on RemoteYeah. This helps us get more companies to post jobs here for you.
Description:
Senior ML Engineer responsible for production ML and AI agent systems.
Build and maintain low-latency ML API for Next Best Action (NBA) engine and collaborate with HAL team on LLM agents.
Requirements:
7+ years of engineering experience with production ML systems.
Strong Python skills; familiarity with TypeScript is a plus.
Experience in tool-design for LLM consumption and eval discipline for non-deterministic systems.
Knowledge of prompt-shape literacy and tool implementation rigor.
Experience with low-latency production APIs and AWS, Docker, GitHub workflows.
Active use of AI tooling in engineering workflow.
Desired: Production agent observability, cost and latency tradeoff intuition, familiarity with agent runtime frameworks, and prior SaaS/FinTech experience.
Benefits:
Remote work flexibility.
Biannual off-site team bonding events.
20 PTO days and national holidays.
Stock options as part of compensation.
Budget for home office setup and work-related trips.