Staff Machine Learning Research Engineer, Agent Post-training - Enterprise Genai
Scale
San Francisco, CA, US
Base: $250,000 - $350,000 usd; equity: included ba...
On-site
5+ years llm training in production
Experience with rlhf/rlvr algorithms
Knowledge of ppo/grpo algorithms
The role involves building a next-gen Agent RL training platform to train best-in-class agents for real enterprise use-cases
Job Summary
The role involves building a next-gen Agent RL training platform to train best-in-class agents for real enterprise use-cases.
Candidates will integrate cutting-edge research into the training stack to enable complex multi-agent systems learning from process and outcome rewards.
Scale offers competitive compensation including base salary, equity, comprehensive health benefits, and a learning stipend.
Matching Summary
The role involves building a next-gen Agent RL training platform to train best-in-class agents for real enterprise use-cases.
Salary
Base: $250,000 - $350,000 USD; Equity: Included based on Board approval; Benefits: Comprehensive health, dental, vision, retirement, learning stipend, and PTO