Python Developer (ai Evaluation Frameworks)

Michelin Group

5-7 years python development experience
Azure cloud services deployment
Restful api development with fastapi or flask
The role focuses on designing and implementing scalable AI evaluation frameworks to assess LLMs, agents, and GenAI features

Job Summary

  • The role focuses on designing and implementing scalable AI evaluation frameworks to assess LLMs, agents, and GenAI features.
  • Candidates must possess a strong QA mindset to build reproducible evaluation workflows and ensure robust code quality through rigorous testing.
  • The position requires deploying evaluation components on Azure while collaborating closely with data scientists and product stakeholders.

Matching Summary

The role focuses on designing and implementing scalable AI evaluation frameworks to assess LLMs, agents, and GenAI features.

Skills & Requirements

Must-have

  • 5-7 years Python development experience
  • Azure cloud services deployment
  • RESTful API development with FastAPI or Flask
  • GenAI and LLM evaluation concepts
  • Test-driven development and QA practices
  • CI/CD pipeline implementation

Nice-to-have

  • LangChain or LlamaIndex framework experience
  • ML/LLM observability tools knowledge
  • NLP metric design (BLEU/ROUGE)
  • Mentorship and team collaboration skills
  • RAG patterns and agent architectures

Key Requirements

  • 5-7 years professional Python engineering experience
  • Strong OOPs and software design principles knowledge
  • Experience with unit and integration test execution
  • Proficiency in static analysis and SonarQube tooling

Work Rights

Not specified

Tailored Resume

Cover Letter