The role involves designing and maintaining comprehensive evaluation tooling to assess AI model performance across chatbots and voice bots
Job Summary
The role involves designing and maintaining comprehensive evaluation tooling to assess AI model performance across chatbots and voice bots.
Genesys employs over 6,000 people globally who embrace empathy and cultivate collaboration to succeed in creating the future of customer experience.
Candidates must balance multiple concurrent projects across diverse languages while maintaining clear visibility on timelines and communicating risks to stakeholders.
Matching Summary
The role involves designing and maintaining comprehensive evaluation tooling to assess AI model performance across chatbots and voice bots.
Skills & Requirements
Must-have
Python pandas NumPy scikit-learn proficiency
Statistical and error analysis for AI models
Data curation and benchmarking frameworks
Bachelor's degree in Computational Linguistics
Native or bilingual English proficiency
Nice-to-have
Master's degree in related field
Experience with LLMs in computational linguistics
Knowledge of additional commercial languages
Familiarity with CI/CD workflows
Agile environment experience
Key Requirements
1–2 years' experience in data curation and evaluation methodologies
Strong skills in statistical and error analysis
Proficiency in Python libraries for data manipulation