Checkmate is building advanced Voice AI systems for major restaurant and retail brands in the US, with AI solutions achieving over 80% accuracy.
The company is scaling to 3,000+ stores by the end of this year, presenting a significant market opportunity.
The role involves developing, testing, and refining prompts for various tasks to optimize Voice AI performance.
Responsibilities include designing evaluation frameworks, analyzing prompt outputs, and identifying improvement opportunities through data-driven analysis.
The position requires conducting experiments to test prompt variations and iterating to enhance accuracy and safety.
The candidate will build regression test suites to ensure compliance and performance as models evolve.
Collaboration with data science, product, legal, engineering, and operations teams is essential to align prompt designs with business goals.
The role includes leading a team of analysts focused on prompt evaluation and data quality analysis, guiding prioritization and reporting.
Continuous learning about emerging prompting techniques and AI safety practices is expected to maintain best-in-class solutions.
Requirements:
Strong analytical and data science skills, with hands-on experience in Python, including libraries such as pandas, NumPy, and scikit-learn.
Experience in designing and conducting experiments and evaluations in applied AI or NLP contexts is required.
Proficiency in SQL and experience working with relational databases like MySQL, PostgreSQL, Oracle, or MS SQL is necessary.
A good understanding of data processing, quality measurement, and testing fundamentals is essential.
Experience in leading analyst or operations teams, with strong prioritization, mentorship, and collaboration skills is required.
A strong problem-solving mindset with a drive to explore, optimize, and automate workflows is necessary.
Excellent communication skills for presenting insights to both technical and non-technical stakeholders are required.
A Bachelor’s degree in Data Science, Computer Science, Statistics, Engineering, or a related field is mandatory.
Flexibility to work US hours until at least 6 p.m. ET, with a strong remote setup is required.
Benefits:
The position offers the opportunity to shape AI products used daily by thousands, driving measurable impact at scale.
Employees will be part of a rapidly growing company in a $1 billion market opportunity.
The role provides a chance to work with advanced technologies and collaborate with cross-functional teams.
Continuous learning and professional development in the field of AI and data science are encouraged.
The company supports a flexible remote work environment, accommodating US working hours.