You will play a pivotal role in defining a modular, scalable platform to seamlessly bridge training and deployment workflows enabling tight integration with training frameworks such as Megatron and Nemo
Job Summary
You will play a pivotal role in defining a modular, scalable platform to seamlessly bridge training and deployment workflows enabling tight integration with training frameworks such as Megatron and Nemo.
Your work will span multiple layers of the DL deployment stack including high-level framework development, GPU kernel optimizations, and performance profiling to maintain NVIDIA's leadership in inference software solutions.
This is an exceptional opportunity to join NVIDIA’s model optimization group and help build real-time, cost-effective computing platforms in a rapidly growing field.
Matching Summary
You will play a pivotal role in defining a modular, scalable platform to seamlessly bridge training and deployment workflows enabling tight integration with training frameworks such as Megatron and Nemo.
Salary
Base: 224,000 USD - 356,500 USD; Bonus/Equity: Eligible for equity; Benefits: Eligible for benefits