Senior Ai Infrastructure Engineer, Model Serving Platform

Scaleai

San Francisco, CA, US
Base: $216,000—$270,000 usd; equity: subject to bo...
On-site
High-performance backend systems
Llm serving and routing fundamentals
Containers and orchestration tools
Build and maintain fault-tolerant, high-performance systems for serving LLMs workloads at scale

Job Summary

  • Build and maintain fault-tolerant, high-performance systems for serving LLMs workloads at scale.
  • Collaborate with researchers and engineers to integrate and optimize models for production and research use cases.
  • Compensation packages include base salary, equity, and benefits.

Matching Summary

Build and maintain fault-tolerant, high-performance systems for serving LLMs workloads at scale.

Salary

Base: $216,000—$270,000 USD; Equity: Subject to Board of Director approval; Benefits: Comprehensive health, dental and vision coverage, retirement benefits, learning and development stipend, generous PTO, commuter stipend.

Skills & Requirements

Must-have

  • high-performance backend systems
  • LLM serving and routing fundamentals
  • containers and orchestration tools
  • cloud infrastructure
  • infrastructure as code

Nice-to-have

  • modern LLM serving frameworks
  • ML fundamentals
  • backend system design expertise
  • collaborative environment
  • fast-moving environments

Key Requirements

  • 5+ years of experience
  • Strong programming skills
  • Experience with LLM capabilities
  • Proven ability to solve complex problems

Work Rights

Not specified

Tailored Resume

Cover Letter