Anthropic Fellows Program — Reinforcement Learning

Anthropic

Remote, US
Base: 3,850 usd / 2,310 gbp / 4,300 cad per week; ...
Fully remote or in-person (berkeley, ca or london, uk)
Fluent in python programming
Full-time research commitment
Ai safety and beneficial ai
The Anthropic Fellows Program is seeking candidates for its Reinforcement Learning workstream, aimed at fostering AI research and engineering talent. Selected fellows will engage in full-time research for four months, supported by mentorship from Anthropic researchers and provided with funding and resources to produce a public output, such as a research paper

Job Summary

  • The Anthropic Fellows Program is designed to foster AI research and engineering talent, providing funding and mentorship to promising technical talent.
  • Fellows will primarily use external infrastructure to work on an empirical project aligned with research priorities, with the goal of producing a public output such as a paper submission.
  • The program offers 4 months of full-time research, direct mentorship from Anthropic researchers, access to a shared workspace, and a weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD plus benefits.

Matching Summary

Match Score: 85

The Anthropic Fellows Program is seeking candidates for its Reinforcement Learning workstream, aimed at fostering AI research and engineering talent. Selected fellows will engage in full-time research for four months, supported by mentorship from Anthropic researchers and provided with funding and resources to produce a public output, such as a research paper.

Salary

Base: 3,850 USD / 2,310 GBP / 4,300 CAD per week; Bonus/Equity: Not specified; Benefits: Varies by country

Skills & Requirements

Must-have

  • Fluent in Python programming
  • Full-time research commitment
  • AI safety and beneficial AI
  • Experience with large-scale distributed systems
  • Experience training/fine-tuning LLMs

Nice-to-have

  • Strong software engineering skills
  • Balance research and engineering rigor
  • Collaborate across disciplines
  • Analyze and debug model training

Key Requirements

  • Work authorization in US, UK, or Canada
  • Available to work full-time for 4 months

Work Rights

Must have work authorization in US, UK, or Canada

Tailored Resume

Cover Letter