Machine Learning Systems Research Engineer, Agent Post-training - Enterprise Genai

Scale

San Francisco, CA, US
Base: $250,000 - $350,000 usd; equity: included ba...
On-site
1-3 years llm training in production
Experience with rlhf/rlvr algorithms
Multi-node llm training and inference
The role involves building, profiling, and optimizing the training and inference framework for next-gen agent RL platforms

Job Summary

  • The role involves building, profiling, and optimizing the training and inference framework for next-gen agent RL platforms.
  • Candidates will post-train state-of-the-art models to define stable recipes for enterprise engagements ranging from cybersecurity to healthtech.
  • Scale offers a competitive compensation package including base salary, equity, comprehensive health benefits, and a learning stipend.

Matching Summary

The role involves building, profiling, and optimizing the training and inference framework for next-gen agent RL platforms.

Salary

Base: $250,000 - $350,000 USD; Equity: Included based on Board approval; Benefits: Health, dental, vision, retirement, PTO, learning stipend

Skills & Requirements

Must-have

  • 1-3 years LLM training in production
  • Experience with RLHF/RLVR algorithms
  • Multi-node LLM training and inference
  • Proficiency in CUDA Pytorch transformers
  • GPU cluster architecture operation

Nice-to-have

  • Passionate about system optimization
  • Strong cross-functional communication skills
  • Research integration of state-of-the-art tech

Key Requirements

  • PhD or Masters in Computer Science
  • 1-3 years of LLM training experience
  • Strong software engineering skills

Work Rights

Not specified

Tailored Resume

Cover Letter