NVIDIA is seeking a Senior ML Evaluation Engineer to join their Autonomous Vehicles Evaluation team, focusing on developing learned evaluation systems for driving behavior. The ideal candidate will possess extensive experience in machine learning, particularly with LLMs and VLMs, and have a strong software engineering background
Job Summary
Design and build learned evaluation pipelines that assess driving behavior using LLMs, VLMs, and multimodal models.
Define evaluation-of-evaluation methodology — how do we know our learned evaluators are correct?
Instrument evaluation systems with robust experiment tracking, A/B comparison tooling, and model versioning.
Matching Summary
Match Score: 85
NVIDIA is seeking a Senior ML Evaluation Engineer to join their Autonomous Vehicles Evaluation team, focusing on developing learned evaluation systems for driving behavior. The ideal candidate will possess extensive experience in machine learning, particularly with LLMs and VLMs, and have a strong software engineering background.
Salary
Base: 184,000 USD - 356,500 USD; Bonus/Equity: Not specified; Benefits: Eligible for equity and benefits
Skills & Requirements
Must-have
LLM/VLM-based pipelines
Agentic workflows
Learned evaluation methodology
Large-scale data processing
Python and C++ software engineering
Nice-to-have
Autonomous driving domain experience
Driving behavior taxonomies
Video understanding models
Agentic AI frameworks
Key Requirements
PhD with 4+ years, MS with 6+ years, or BS with 8+ years experience