Tech Lead Manager- Mlre, Ml Systems

Scale

San Francisco, CA, US
Base: $264,800 - $331,000 usd; bonus/equity: equit...
On-site
Experience with multi-node llm training
Developing large-scale distributed ml systems
Proficiency in cuda pytorch transformers flash attention
The role involves building and optimizing the platform to enable next-generation LLM training, inference, and data curation

Job Summary

  • The role involves building and optimizing the platform to enable next-generation LLM training, inference, and data curation.
  • Candidates will collaborate closely with ML teams and researchers to accelerate research and development efforts.
  • Compensation includes base salary ranging from $264,800 to $331,000 USD plus equity and comprehensive benefits.

Matching Summary

The role involves building and optimizing the platform to enable next-generation LLM training, inference, and data curation.

Salary

Base: $264,800 - $331,000 USD; Bonus/Equity: Equity grant subject to Board approval; Benefits: Comprehensive health, dental, vision, retirement, learning stipend, PTO

Skills & Requirements

Must-have

  • Experience with multi-node LLM training
  • Developing large-scale distributed ML systems
  • Proficiency in CUDA Pytorch transformers flash attention
  • Strong software engineering skills
  • Experience with post-training methods like RLHF

Nice-to-have

  • Expertise in instruction tuning and tool use
  • Knowledge of reasoning agents and multimodal models
  • Passion for system optimization
  • Cross-functional communication skills

Key Requirements

  • Experience with PPO/GRPO algorithms
  • Strong written and verbal communication skills
  • Proficiency in state-of-the-art ML technologies

Work Rights

Not specified

Tailored Resume

Cover Letter