Staff Machine Learning Research Engineer, Agent Post-training - Enterprise Genai

Scale

San Francisco, CA, US
Base: $250,000 - $350,000 usd; equity: included ba...
On-site
5+ years llm training in production
Experience with rlhf/rlvr algorithms
Knowledge of ppo/grpo algorithms
The role involves building a next-gen Agent RL training platform to train best-in-class agents for real enterprise use-cases

Job Summary

  • The role involves building a next-gen Agent RL training platform to train best-in-class agents for real enterprise use-cases.
  • Candidates will integrate cutting-edge research into the training stack to enable complex multi-agent systems learning from process and outcome rewards.
  • Scale offers competitive compensation including base salary, equity, comprehensive health benefits, and a learning stipend.

Matching Summary

The role involves building a next-gen Agent RL training platform to train best-in-class agents for real enterprise use-cases.

Salary

Base: $250,000 - $350,000 USD; Equity: Included based on Board approval; Benefits: Comprehensive health, dental, vision, retirement, learning stipend, and PTO

Skills & Requirements

Must-have

  • 5+ years LLM training in production
  • Experience with RLHF/RLVR algorithms
  • Knowledge of PPO/GRPO algorithms
  • Design solutions for multi-agent systems

Nice-to-have

  • Publications in NEURIPS, ICLR, or ICML
  • PhD or Masters in Computer Science
  • Experience deploying to enterprise customers

Key Requirements

  • 5+ years of LLM training experience
  • PhD or Masters in Computer Science
  • Recent publications in top AI conferences

Work Rights

Not specified

Tailored Resume

Cover Letter