Research Engineer, Production Model Post-training

Anthropic

San Francisco, CA, US
$350,000 — $500,000 usd py
On-site
Implement and optimize post-training techniques
Develop tools to measure model performance
Debug complex training pipeline issues
As a Research Engineer on our Post-Training team, you'll train our base models through the complete post-training stack to deliver the production Claude models that users interact with

Job Summary

  • As a Research Engineer on our Post-Training team, you'll train our base models through the complete post-training stack to deliver the production Claude models that users interact with.
  • You'll work at the intersection of cutting-edge research and production engineering, implementing, scaling, and improving post-training techniques like Constitutional AI, RLHF, and other alignment methodologies.
  • Anthropic's mission is to create reliable, interpretable, and steerable AI systems, aiming for AI to be safe and beneficial for users and society.

Matching Summary

As a Research Engineer on our Post-Training team, you'll train our base models through the complete post-training stack to deliver the production Claude models that users interact with.

Salary

$350,000 — $500,000 USD

Skills & Requirements

Must-have

  • Implement and optimize post-training techniques
  • Develop tools to measure model performance
  • Debug complex training pipeline issues
  • Experience with large language models
  • Python programming proficiency
  • Distributed systems and HPC experience

Nice-to-have

  • Thrive in controlled chaos
  • Adapt quickly to changing priorities
  • Maintain clarity debugging complex issues
  • Balance research exploration with engineering rigor
  • Navigate ambiguity in fast-moving research

Key Requirements

  • Bachelor's degree or equivalent experience
  • Proficiency in Python, deep learning frameworks, and distributed computing

Work Rights

Not specified

Sponsorship: available

Tailored Resume

Cover Letter