Base: 184,000 usd - 356,500 usd depending on level...
Deep learning model optimization
Large language models (llms)
Vision-language models (vlms)
You will collaborate with research scientists, software engineers, and hardware specialists to bring cutting-edge AI models from prototype to production
Job Summary
You will collaborate with research scientists, software engineers, and hardware specialists to bring cutting-edge AI models from prototype to production.
As NVIDIA makes inroads into the Datacenter business, our team plays a central role in optimizing datacenter deployments and hardware design.
The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5, with eligibility for equity and benefits.
Matching Summary
You will collaborate with research scientists, software engineers, and hardware specialists to bring cutting-edge AI models from prototype to production.
Salary
Base: 184,000 USD - 356,500 USD depending on level; Bonus/Equity: Eligible for equity; Benefits: Eligible for benefits
Skills & Requirements
Must-have
Deep learning model optimization
Large Language Models (LLMs)
Vision-Language Models (VLMs)
TensorRT and TensorRT-LLM frameworks
PyTorch or TensorFlow deployment
Python and C++ programming
Nice-to-have
Model serving frameworks experience
Collaboration with research and hardware teams
Creative and autonomous work style
Key Requirements
Master’s or PhD in Computer Science or related field
4+ years professional deep learning experience
Strong foundation in transformer architectures and attention mechanisms