Applied Research Engineer Category Computer/software Location San Francisco, California Job Type Full Time

hireVouch

San Francisco, California, US
Not specified; not specified; not specified
On-site
Phd or master's in computer science
3+ years ml experience
Python and pytorch proficiency
The role focuses on developing cutting-edge systems for Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO)

Job Summary

  • The role focuses on developing cutting-edge systems for Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO).
  • Candidates will design rigorous measurement systems to improve the quality of human-in-the-loop data for frontier AI models.
  • The company offers a high-impact environment where employees take expanded responsibilities quickly with career growth tied directly to contributions.

Matching Summary

The role focuses on developing cutting-edge systems for Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO).

Salary

Not specified; Not specified; Not specified

Skills & Requirements

Must-have

  • PhD or Master's in Computer Science
  • 3+ years ML experience
  • Python and PyTorch proficiency
  • RLHF and DPO expertise
  • Top-tier conference publications

Nice-to-have

  • High agency and ownership mindset
  • Rapid execution capability
  • Cross-functional collaboration skills
  • Interest in human-AI collaboration

Key Requirements

  • Ph.D. or Master's degree required
  • Minimum 3 years of complex ML challenge experience
  • Track record of publishing at NeurIPS, ICML, or similar conferences

Work Rights

Not specified

Tailored Resume

Cover Letter