Engineering Manager, Model Serving

Togetherai

San Francisco, CA, United States
$250,000 - $300,000; equity; health insurance + ot...
On-site
Ml inference serving frameworks
Kubernetes multi-cluster orchestration
Multi-tenant saas platforms
The primary focus will be on delivering world-class inference and fine-tuning in our public APIs and customer deployments by building automation and operations processes

Job Summary

  • The primary focus will be on delivering world-class inference and fine-tuning in our public APIs and customer deployments by building automation and operations processes.
  • You will be in charge of designing and scaling our ML processes & tooling at production scale – optimizing operations to ensure availability and reliability for our services.
  • We offer competitive compensation, startup equity, health insurance and other competitive benefits.

Matching Summary

The primary focus will be on delivering world-class inference and fine-tuning in our public APIs and customer deployments by building automation and operations processes.

Salary

$250,000 - $300,000; equity; health insurance and other competitive benefits

Skills & Requirements

Must-have

  • ML inference serving frameworks
  • Kubernetes multi-cluster orchestration
  • multi-tenant SaaS platforms
  • LLM inference serving systems
  • availability and performance SLAs
  • incident response and postmortems

Nice-to-have

  • building internal developer platforms
  • cost optimization for GPU infrastructure
  • contributions to open-source ML projects

Key Requirements

  • 5+ years operating production ML systems
  • 2+ years senior IC or tech lead
  • SLA ownership with specific metrics
  • Customer escalation and incident communication

Work Rights

Not specified

Tailored Resume

Cover Letter