Research Engineer, Core Ml

Together AI

San Francisco, United States
Base: $200,000 - $280,000; equity: startup equity ...
On-site
Python programming expertise
Large-scale inference systems
Rl and post-training pipelines
This role involves translating frontier RL algorithms and scheduling methods into production-grade systems that power the company's API

Job Summary

  • This role involves translating frontier RL algorithms and scheduling methods into production-grade systems that power the company's API.
  • The team focuses on unifying efficient inference with RL-driven training to make models faster, cheaper, and more capable at scale.
  • Candidates are expected to own critical systems end-to-end, modifying kernels, memory layouts, and scheduling logic to drive measurable improvements.

Matching Summary

This role involves translating frontier RL algorithms and scheduling methods into production-grade systems that power the company's API.

Salary

Base: $200,000 - $280,000; Equity: Startup equity included; Benefits: Health insurance and other competitive benefits

Skills & Requirements

Must-have

  • Python programming expertise
  • Large-scale inference systems
  • RL and post-training pipelines
  • GPU performance optimization
  • Distributed serving architecture

Nice-to-have

  • Bias toward implementation and shipping
  • Full-stack ownership mindset
  • Experience with speculative decoding
  • Collaboration across research and infra teams
  • Track record of impactful open-source projects

Key Requirements

  • 3+ years experience in ML systems or large-scale model training
  • Advanced degree in Computer Science, EE, or equivalent practical experience
  • Demonstrated experience owning complex technical projects end-to-end

Work Rights

Not specified

Tailored Resume

Cover Letter