Safeguards Analyst, Human Exploitation & Abuse

Anthropic

Remote
$245,000—$285,000 usd py
Remote
Human exploitation and abuse detection
Enforcement workflows
Sql and data analysis
Anthropic's mission is to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society

Job Summary

  • Anthropic's mission is to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society.
  • As a Safeguards Analyst, you will build and execute enforcement workflows to detect and mitigate the use of AI products for human exploitation and abuse.
  • This role involves designing automated systems, partnering with engineering teams, conducting deep-dive investigations, and collaborating with external intelligence partners.

Matching Summary

Anthropic's mission is to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society.

Salary

$245,000—$285,000 USD

Skills & Requirements

Must-have

  • human exploitation and abuse detection
  • enforcement workflows
  • SQL and data analysis
  • content moderation
  • counter-exploitation work
  • sensitive content review

Nice-to-have

  • AI safety interest
  • generative AI products
  • threat actor profiling
  • external intelligence partners

Key Requirements

  • 3+ years trust and safety experience
  • Subject matter expertise in human trafficking/exploitation
  • Experience building detection/review workflows
  • Proficiency in SQL, Python, or data analysis tools
  • Ability to analyze complex situations
  • Sound judgment on content
  • Strong attention to detail

Work Rights

Not specified

Sponsorship: available

Tailored Resume

Cover Letter