Anthropic's mission is to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society
Job Summary
Anthropic's mission is to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society.
As a Safeguards Analyst, you will build and execute enforcement workflows to detect and mitigate the use of AI products for human exploitation and abuse.
This role involves designing automated systems, partnering with engineering teams, conducting deep-dive investigations, and collaborating with external intelligence partners.
Matching Summary
Anthropic's mission is to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society.
Salary
$245,000—$285,000 USD
Skills & Requirements
Must-have
human exploitation and abuse detection
enforcement workflows
SQL and data analysis
content moderation
counter-exploitation work
sensitive content review
Nice-to-have
AI safety interest
generative AI products
threat actor profiling
external intelligence partners
Key Requirements
3+ years trust and safety experience
Subject matter expertise in human trafficking/exploitation
Experience building detection/review workflows
Proficiency in SQL, Python, or data analysis tools