Staff Machine Learning Engineer, Ai Serving

Reddit

Remote, US
Base: $253,300 - $354,600 usd; equity: eligible fo...
Remote
7+ years ml engineering experience
Kubernetes orchestration at scale
Python and go programming proficiency
The Machine Learning Platform team owns the infrastructure powering recommendations, content discovery, and user quantification across Reddit

Job Summary

  • The Machine Learning Platform team owns the infrastructure powering recommendations, content discovery, and user quantification across Reddit.
  • This role involves leading the end-to-end design of a highly available, low-latency GPU-based model serving system supporting millions of QPS.
  • Candidates will benefit from comprehensive healthcare, 401k matching, and generous paid parental leave in a remote-first environment.

Matching Summary

The Machine Learning Platform team owns the infrastructure powering recommendations, content discovery, and user quantification across Reddit.

Salary

Base: $253,300 - $354,600 USD; Equity: Eligible for restricted stock units; Benefits: Comprehensive healthcare, 401k match, and paid time off

Skills & Requirements

Must-have

  • 7+ years ML Engineering experience
  • Kubernetes orchestration at scale
  • Python and Go programming proficiency
  • AWS or Google Cloud infrastructure
  • GPU model serving system design

Nice-to-have

  • Experience with Triton and vLLM frameworks
  • Deep understanding of multi-cluster topology
  • Strong communication with non-technical stakeholders
  • E2E inference performance benchmarking
  • Real-time ML observability implementation

Key Requirements

  • 7+ years of experience in ML Engineering
  • Proficiency in Python and Go languages
  • Experience operating Kubernetes at scale
  • Deep knowledge of cloud-based ML technologies

Work Rights

Not specified

Tailored Resume

Cover Letter