The role requires designing automated evaluation pipelines tied to releases for LLM quality and safety.
Must-have
Nice-to-have
Not specified