Safeguards Enforcement Analyst, Safety Evaluations

Anthropic

Remote
$230,000 - $270,000 usd py
Remote
Trust and safety operations
Policy enforcement experience
Process building from scratch
The Safeguards team is responsible for enforcing policies, protecting users, and ensuring the platform is not misused, playing a central role in ensuring models meet safety and policy standards

Job Summary

  • The Safeguards team is responsible for enforcing policies, protecting users, and ensuring the platform is not misused, playing a central role in ensuring models meet safety and policy standards.
  • This role requires partnering closely with policy experts, Safeguards engineering teams, and other stakeholders to ensure evaluations are comprehensive and findings translate into meaningful improvements to model behavior.
  • Anthropic values impact and advancing long-term goals of steerable, trustworthy AI, viewing AI research as an empirical science and fostering a highly collaborative environment.

Matching Summary

The Safeguards team is responsible for enforcing policies, protecting users, and ensuring the platform is not misused, playing a central role in ensuring models meet safety and policy standards.

Salary

$230,000 - $270,000 USD

Skills & Requirements

Must-have

  • Trust and safety operations
  • Policy enforcement experience
  • Process building from scratch
  • Cross-functional coordination
  • Ambiguity navigation
  • Detail-oriented analysis

Nice-to-have

  • AI-assisted workflows
  • Sensitive content handling
  • High-stakes timeline experience
  • Technical toolkit expansion

Key Requirements

  • Bachelor's degree or equivalent experience
  • Relevant field of study
  • Minimum years of experience correlating to job level

Work Rights

Not specified

Sponsorship: available

Tailored Resume

Cover Letter