Machine Learning Engineer, Geforce G-assist

Invidia

Us, CA, United States
Base: 184,000 usd - 287,500 usd for level 4; 224,0...
Hybrid
C/c++ performance-critical coding
Python programming proficiency
Experience with llama.cpp or similar frameworks
At NVIDIA, we’re building GeForce G-Assist — an on-device AI assistant that combines Small Language Models, retrieval systems, and hybrid cloud capabilities to deliver responsive, context-aware assistance inside the GeForce ecosystem

Job Summary

  • At NVIDIA, we’re building GeForce G-Assist — an on-device AI assistant that combines Small Language Models, retrieval systems, and hybrid cloud capabilities to deliver responsive, context-aware assistance inside the GeForce ecosystem.
  • We value engineers who enjoy thinking across the full system—from model behavior to runtime performance—and collaborate closely with product and engineering teams to ensure reliability in real-world scenarios.
  • With competitive salaries and a generous benefits package, we are considered to be one of the technology world's most desirable employers, rapidly growing exclusive engineering teams due to unprecedented growth.

Matching Summary

At NVIDIA, we’re building GeForce G-Assist — an on-device AI assistant that combines Small Language Models, retrieval systems, and hybrid cloud capabilities to deliver responsive, context-aware assistance inside the GeForce ecosystem.

Salary

Base: 184,000 USD - 287,500 USD for Level 4; 224,000 USD - 356,500 USD for Level 5; Bonus/Equity: Eligible for equity; Benefits: Generous benefits package

Skills & Requirements

Must-have

  • C/C++ performance-critical coding
  • Python programming proficiency
  • Experience with llama.cpp or similar frameworks
  • Evaluation of Small Language Models
  • Knowledge of SLM and VLM architectures
  • Retrieval-augmented generation systems
  • Hybrid local and cloud AI architectures

Nice-to-have

  • Collaborative cross-team environment
  • Problem solving and learning mindset
  • Experience with agentic AI workflows
  • Multimodal model contributions
  • Translating user feedback into improvements

Key Requirements

  • 8+ years experience in system software or related field
  • M.S. or higher in Computer Science or related field or equivalent experience
  • Strong C/C++ and Python coding skills
  • Hands-on experience with local inference frameworks
  • Experience evaluating conversational AI models
  • Knowledge of retrieval technologies and agentic AI
  • Not specified work authorization requirements

Work Rights

Not specified

Tailored Resume

Cover Letter