Member Of Technical Staff - Post-training And Rl

xAI

Palo Alto, CA, United States
Base: $180,000 - $600,000 usd; bonus/equity: equit...
On-site
Post-training techniques
Reinforcement learning methods
Reward modeling expertise
xAI is seeking a Member of Technical Staff specializing in post-training and reinforcement learning to contribute to their mission of creating AI systems that accurately understand the universe. The ideal candidate will be hands-on, possess strong communication skills, and thrive in a meritocratic environment, with a focus on building effective AI models

Job Summary

  • The role focuses on solving critical post-training and reinforcement learning challenges including reward modeling and preference optimization.
  • xAI operates with a flat organizational structure where all employees are expected to be hands-on and contribute directly to the mission.
  • The compensation package includes a base salary ranging from $180,000 to $600,000 USD along with equity and comprehensive benefits.

Matching Summary

Match Score: 85

xAI is seeking a Member of Technical Staff specializing in post-training and reinforcement learning to contribute to their mission of creating AI systems that accurately understand the universe. The ideal candidate will be hands-on, possess strong communication skills, and thrive in a meritocratic environment, with a focus on building effective AI models.

Salary

Base: $180,000 - $600,000 USD; Bonus/Equity: Equity included in total rewards; Benefits: Medical, vision, dental, 401(k), disability, life insurance

Skills & Requirements

Must-have

  • Post-training techniques
  • Reinforcement learning methods
  • Reward modeling expertise
  • Preference optimization skills
  • RLHF or DPO experience

Nice-to-have

  • Power user of AI models
  • Meritocratic environment fit
  • Strong communication skills
  • Hands-on engineering mindset
  • Truth-seeking AI focus

Key Requirements

  • Obsession with building useful models through RL
  • Pride in work within meritocratic environments
  • Experience with training models used by millions (plus)

Work Rights

Not specified

Tailored Resume

Cover Letter