Hybrid (expected to report to the sunnyvale technical center at least three times per week)
8+ years industry experience in ml systems
Expertise in go python or c++
Experience with ml inference frameworks like triton vllm
General Motors is seeking a Staff ML Engineer for their Inference Platform team in Sunnyvale, CA, focusing on building a robust machine learning infrastructure for AI applications. The ideal candidate should have extensive experience in machine learning systems, backend services, and cloud platforms, and will play a key role in enhancing the architecture and usability of ML inference services
Job Summary
The team owns the cloud-agnostic, reliable, and cost-efficient platform that powers GM's AI efforts including autonomous vehicles.
This role involves leading technical decision-making on model serving strategies, orchestration, caching, and auto-scaling mechanisms.
Successful candidates are eligible for relocation assistance and a company vehicle evaluation program upon completion of a motor vehicle report review.
Matching Summary
Match Score: 85
General Motors is seeking a Staff ML Engineer for their Inference Platform team in Sunnyvale, CA, focusing on building a robust machine learning infrastructure for AI applications. The ideal candidate should have extensive experience in machine learning systems, backend services, and cloud platforms, and will play a key role in enhancing the architecture and usability of ML inference services.
Salary
Base: $185,500 - $270,000; Bonus: Incentive pay based on performance; Benefits: Medical dental vision retirement savings tuition assistance vehicle discounts
Skills & Requirements
Must-have
8+ years industry experience in ML systems
Expertise in Go Python or C++
Experience with ML inference frameworks like Triton vLLM
Strong background in distributed systems design
Cloud platform experience GCP Azure AWS
Nice-to-have
Hands-on experience building ML infrastructure platforms
Familiarity with Ray framework and vLLM
Contributions to open-source ML serving frameworks
Experience designing APIs and clients for ML workflows
Knowledge of hardware acceleration and GPU optimizations
Key Requirements
8+ years of industry experience
Expertise in Go Python C++ or relevant languages
Proven ability to drive cross-functional initiatives