Staff Ml Engineer, Inference Platform

General Motors

Sunnyvale, CA, US
Base: $185,500 - $270,000; bonus: incentive pyy ba...
Hybrid (expected to report to the sunnyvale technical center at least three times per week)
8+ years industry experience in ml systems
Expertise in go python or c++
Experience with ml inference frameworks like triton vllm
General Motors is seeking a Staff ML Engineer for their Inference Platform team in Sunnyvale, CA, focusing on building a robust machine learning infrastructure for AI applications. The ideal candidate should have extensive experience in machine learning systems, backend services, and cloud platforms, and will play a key role in enhancing the architecture and usability of ML inference services

Job Summary

  • The team owns the cloud-agnostic, reliable, and cost-efficient platform that powers GM's AI efforts including autonomous vehicles.
  • This role involves leading technical decision-making on model serving strategies, orchestration, caching, and auto-scaling mechanisms.
  • Successful candidates are eligible for relocation assistance and a company vehicle evaluation program upon completion of a motor vehicle report review.

Matching Summary

Match Score: 85

General Motors is seeking a Staff ML Engineer for their Inference Platform team in Sunnyvale, CA, focusing on building a robust machine learning infrastructure for AI applications. The ideal candidate should have extensive experience in machine learning systems, backend services, and cloud platforms, and will play a key role in enhancing the architecture and usability of ML inference services.

Salary

Base: $185,500 - $270,000; Bonus: Incentive pay based on performance; Benefits: Medical dental vision retirement savings tuition assistance vehicle discounts

Skills & Requirements

Must-have

  • 8+ years industry experience in ML systems
  • Expertise in Go Python or C++
  • Experience with ML inference frameworks like Triton vLLM
  • Strong background in distributed systems design
  • Cloud platform experience GCP Azure AWS

Nice-to-have

  • Hands-on experience building ML infrastructure platforms
  • Familiarity with Ray framework and vLLM
  • Contributions to open-source ML serving frameworks
  • Experience designing APIs and clients for ML workflows
  • Knowledge of hardware acceleration and GPU optimizations

Key Requirements

  • 8+ years of industry experience
  • Expertise in Go Python C++ or relevant languages
  • Proven ability to drive cross-functional initiatives

Work Rights

Not specified

Tailored Resume

Cover Letter