The Safeguards team is responsible for enforcing policies, protecting users, and ensuring the platform is not misused, playing a central role in ensuring models meet safety and policy standards
Job Summary
The Safeguards team is responsible for enforcing policies, protecting users, and ensuring the platform is not misused, playing a central role in ensuring models meet safety and policy standards.
This role requires partnering closely with policy experts, Safeguards engineering teams, and other stakeholders to ensure evaluations are comprehensive and findings translate into meaningful improvements to model behavior.
Anthropic values impact and advancing long-term goals of steerable, trustworthy AI, viewing AI research as an empirical science and fostering a highly collaborative environment.
Matching Summary
The Safeguards team is responsible for enforcing policies, protecting users, and ensuring the platform is not misused, playing a central role in ensuring models meet safety and policy standards.
Salary
$230,000 - $270,000 USD
Skills & Requirements
Must-have
Trust and safety operations
Policy enforcement experience
Process building from scratch
Cross-functional coordination
Ambiguity navigation
Detail-oriented analysis
Nice-to-have
AI-assisted workflows
Sensitive content handling
High-stakes timeline experience
Technical toolkit expansion
Key Requirements
Bachelor's degree or equivalent experience
Relevant field of study
Minimum years of experience correlating to job level