Engineering Manager, Agent Prompts & Evals

Anthropic

San Francisco, CA, US
$1—$2 usd ph
On-site
Lead and grow engineering team
Own product-side eval platform
Own system prompt infrastructure
This team owns the infrastructure that lets Anthropic ship model and prompt changes with confidence — the eval frameworks, system prompt pipelines, and regression-detection systems that every model launch depends on

Job Summary

  • This team owns the infrastructure that lets Anthropic ship model and prompt changes with confidence — the eval frameworks, system prompt pipelines, and regression-detection systems that every model launch depends on.
  • You’ll partner closely with other evals groups across the company on shared infrastructure and methodology, with product teams who are shipping features on top of Claude, and with the TPMs and research PMs driving model launches.
  • We encourage you to apply even if you do not believe you meet every single qualification.

Matching Summary

This team owns the infrastructure that lets Anthropic ship model and prompt changes with confidence — the eval frameworks, system prompt pipelines, and regression-detection systems that every model launch depends on.

Salary

$1—$2 USD

Skills & Requirements

Must-have

  • Lead and grow engineering team
  • Own product-side eval platform
  • Own system prompt infrastructure
  • Build durable collaboration with partner orgs
  • Recruit and retain senior engineers

Nice-to-have

  • Interest in AI safety and alignment
  • Experience with LLM evals
  • Background in devtools at scale
  • Managing teams between larger orgs

Key Requirements

  • 8+ years software engineering
  • 3+ years managing engineering teams
  • Experience leading platform/infra/devtools teams
  • Bachelor's degree or equivalent experience

Work Rights

Not specified

Sponsorship: available

Tailored Resume

Cover Letter