Research Scientist, Safety Post Training

Scale

San Francisco, CA, US
Base: $216,000 - $270,000 usd; equity: included su...
On-site
Experience with rlhf dpo grpo techniques
Track record of published ml research
Three years experience in sophisticated ml problems
Scale Labs is launching a new team focused on policy research to bridge the gap between AI research and global policymakers

Job Summary

  • Scale Labs is launching a new team focused on policy research to bridge the gap between AI research and global policymakers.
  • The role involves designing post-training pipelines to study how training choices affect model safety, robustness, and alignment properties.
  • Compensation includes base salary ranging from $216,000 to $270,000 USD along with equity and comprehensive benefits.

Matching Summary

Scale Labs is launching a new team focused on policy research to bridge the gap between AI research and global policymakers.

Salary

Base: $216,000 - $270,000 USD; Equity: Included subject to Board approval; Benefits: Comprehensive health dental vision retirement PTO stipend

Skills & Requirements

Must-have

  • Experience with RLHF DPO GRPO techniques
  • Track record of published ML research
  • Three years experience in sophisticated ML problems
  • Strong written and verbal communication skills

Nice-to-have

  • Experience with mechanistic interpretability or probing
  • Familiarity with red-teaming adversarial evaluation
  • Experience studying reward hacking or sycophancy failure modes

Key Requirements

  • At least three years of experience addressing sophisticated ML problems
  • Published research in machine learning particularly generative AI
  • Commitment to promoting safe secure and trustworthy AI deployments

Work Rights

Not specified

Tailored Resume

Cover Letter