Designing evaluation frameworks for agentic systems
Building llm-as-judge pipelines
Integrating evaluation into ci/cd
AI Tech is DNB’s new division within Technology & Services, created to accelerate our shift from AI experimentation to real, measurable impact
Job Summary
AI Tech is DNB’s new division within Technology & Services, created to accelerate our shift from AI experimentation to real, measurable impact.
You will be designing evaluation frameworks for agentic systems, ensuring quality, coverage, safety, efficiency, and regulatory compliance.
You’ll work on challenging and meaningful tasks in a strong engineering culture with solid opportunities for professional growth and career development.
Matching Summary
AI Tech is DNB’s new division within Technology & Services, created to accelerate our shift from AI experimentation to real, measurable impact.
Skills & Requirements
Must-have
designing evaluation frameworks for agentic systems
building LLM-as-judge pipelines
integrating evaluation into CI/CD
constructing and maintaining evaluation datasets
proficiency in Python for experiments
experience with agentic evaluation techniques
Nice-to-have
evidence driven mindset
role model for rigorous AI testing
communicate and collaborate effectively
take true ownership
Key Requirements
Relevant background from evaluating AI / ML systems
Knowledge of observability, logging, and dataset curation
Experience running structured experiments
Familiarity with AI tooling for developer productivity