Ai Researcher, Core Ml (turbo)

Together AI

San Francisco, United States
Base: $200,000 - $280,000; bonus/equity: startup e...
On-site
Large-scale inference systems expertise
Rl/post-training methods like grpo or dpo
Python coding and gpu performance profiling
The role focuses on pushing the frontier of efficient inference and RL-driven training to make models faster and cheaper at production scale

Job Summary

  • The role focuses on pushing the frontier of efficient inference and RL-driven training to make models faster and cheaper at production scale.
  • Candidates must be comfortable working across the stack from RL algorithms and training engines down to kernels and serving systems.
  • The position offers competitive compensation including a base salary range of $200,000 - $280,000 plus equity and benefits.

Matching Summary

The role focuses on pushing the frontier of efficient inference and RL-driven training to make models faster and cheaper at production scale.

Salary

Base: $200,000 - $280,000; Bonus/Equity: Startup equity included; Benefits: Health insurance and other competitive benefits

Skills & Requirements

Must-have

  • Large-scale inference systems expertise
  • RL/post-training methods like GRPO or DPO
  • Python coding and GPU performance profiling
  • Production-grade engine implementation skills
  • Distributed systems and HPC knowledge

Nice-to-have

  • Full-stack ownership across algorithms and systems
  • Experience with speculative decoding systems
  • Collaboration across infra, research, and product teams
  • Track record of impactful ML research papers
  • Interest in growing from spiky to full-stack

Key Requirements

  • 3+ years experience in ML systems or large-scale model training
  • Advanced degree in Computer Science, EE, or equivalent practical experience
  • Demonstrated experience owning complex technical projects end-to-end

Work Rights

Not specified

Tailored Resume

Cover Letter