Ai Evaluation Engineer

DNB Bank ASA

Norway
On-site
Designing evaluation frameworks for agentic systems
Building llm-as-judge pipelines
Integrating evaluation into ci/cd
AI Tech is DNB’s new division within Technology & Services, created to accelerate our shift from AI experimentation to real, measurable impact

Job Summary

  • AI Tech is DNB’s new division within Technology & Services, created to accelerate our shift from AI experimentation to real, measurable impact.
  • You will be designing evaluation frameworks for agentic systems, ensuring quality, coverage, safety, efficiency, and regulatory compliance.
  • You’ll work on challenging and meaningful tasks in a strong engineering culture with solid opportunities for professional growth and career development.

Matching Summary

AI Tech is DNB’s new division within Technology & Services, created to accelerate our shift from AI experimentation to real, measurable impact.

Skills & Requirements

Must-have

  • designing evaluation frameworks for agentic systems
  • building LLM-as-judge pipelines
  • integrating evaluation into CI/CD
  • constructing and maintaining evaluation datasets
  • proficiency in Python for experiments
  • experience with agentic evaluation techniques

Nice-to-have

  • evidence driven mindset
  • role model for rigorous AI testing
  • communicate and collaborate effectively
  • take true ownership

Key Requirements

  • Relevant background from evaluating AI / ML systems
  • Knowledge of observability, logging, and dataset curation
  • Experience running structured experiments
  • Familiarity with AI tooling for developer productivity
  • Experience with agentic evaluation solutions

Work Rights

Not specified

Tailored Resume

Cover Letter