Senior Genai Algorithms Engineer — Post-training Optimizations

NVIDIA

Base: $152,000 - $241,500 (level 3) or $184,000 - ...
**
5+ years deep learning experience
Python and pytorch proficiency
Model optimization techniques
** NVIDIA is seeking a Senior GenAI Algorithms Engineer to enhance generative AI models through model optimization techniques for improved inference efficiency. The role demands expertise in deep learning, software design, and collaboration within NVIDIA's AI ecosystem, with a focus on algorithm development and hardware integration. **

Job Summary

  • This role focuses on optimizing generative AI models like LLMs and diffusion models for maximal inference efficiency using advanced techniques such as quantization and sparsity.
  • Candidates will design modular software platforms and integrate innovative optimization algorithms into NVIDIA's ecosystem including TensorRT-LLM and Megatron-LM.
  • The position offers competitive compensation ranging from $152,000 to $287,500 based on level, along with equity and comprehensive benefits.

Matching Summary

Match Score: 75

** NVIDIA is seeking a Senior GenAI Algorithms Engineer to enhance generative AI models through model optimization techniques for improved inference efficiency. The role demands expertise in deep learning, software design, and collaboration within NVIDIA's AI ecosystem, with a focus on algorithm development and hardware integration. **

Salary

Base: $152,000 - $241,500 (Level 3) or $184,000 - $287,500 (Level 4); Bonus/Equity: Eligible for equity; Benefits: Comprehensive benefits package included

Skills & Requirements

Must-have

  • 5+ years deep learning experience
  • Python and PyTorch proficiency
  • Model optimization techniques
  • GPU architecture knowledge
  • Software systems design skills

Nice-to-have

  • Contributions to open-source ML frameworks
  • Experience with large-scale GPU clusters
  • CUDA and Triton kernel development
  • Strong communication in fast-paced environment
  • Familiarity with NVIDIA NeMo and TensorRT

Key Requirements

  • Master's or PhD in Computer Science or related field
  • 5+ years relevant work or research experience
  • Strong foundation in algorithms and programming fundamentals

Work Rights

Not specified

Tailored Resume

Cover Letter