[expression Of Interest] Research Scientist / Engineer, Honesty

Anthropic

New York City, NY, USA
$350,000 - $500,000 usd py
On-site
Language model finetuning
Classifier training
Experimental design
Anthropic's mission is to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society

Job Summary

  • Anthropic's mission is to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society.
  • Responsibilities include designing data curation pipelines, developing classifiers for hallucinations, creating honesty benchmarks, and implementing techniques to ground model outputs.
  • The company offers competitive compensation, generous vacation and parental leave, flexible working hours, and a collaborative research environment focused on high-impact AI safety.

Matching Summary

Anthropic's mission is to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society.

Salary

$350,000 - $500,000 USD

Skills & Requirements

Must-have

  • language model finetuning
  • classifier training
  • experimental design
  • statistical analysis
  • data curation pipelines
  • human feedback collection

Nice-to-have

  • AI safety research
  • factual knowledge bases
  • confidence estimation methods
  • RLHF for truthfulness
  • collaboration and communication

Key Requirements

  • MS/PhD in Computer Science, ML, or related field
  • Strong programming skills in Python
  • Industry experience with language models
  • Proficiency in experimental design
  • Experience in data science or dataset creation

Work Rights

Not specified

Sponsorship: available

Tailored Resume

Cover Letter