Define and own the end-to-end AI evaluation architecture across speech, NLP, and GenAI platforms
Job Summary
Define and own the end-to-end AI evaluation architecture across speech, NLP, and GenAI platforms.
Design and lead implementation of a scalable AI testing platform that includes offline evaluation pipelines, golden dataset-driven regression systems, synthetic data generation frameworks, and online A/B testing & shadow deployment strategies.
Serve as the principal authority on AI testing and evaluation strategy, influencing architecture decisions and mentoring senior engineers.
Matching Summary
Define and own the end-to-end AI evaluation architecture across speech, NLP, and GenAI platforms.