Base: 152,000 usd - 241,500 usd for level 3; 184,0...
C++ and python programming
Parallel systems programming on gpus
Deep learning model performance optimization
Our team builds optimizations directly into mainstream open source Deep Learning frameworks to boost performance across NVIDIA's AI stack
Job Summary
Our team builds optimizations directly into mainstream open source Deep Learning frameworks to boost performance across NVIDIA's AI stack.
You will build and support the Transformer Engine library to accelerate training of Large Language Models and collaborate on systems research to improve model performance.
The role includes engaging with the open-source community, supporting enterprise customers, and influencing the design of new hardware and software components.
Matching Summary
Our team builds optimizations directly into mainstream open source Deep Learning frameworks to boost performance across NVIDIA's AI stack.