Base: $184,000 - $356,500 usd depending on level; ...
Hybrid
Deep learning and neural network training expertise
Gpu architecture fundamentals and computer architecture
C++, python, and cuda programming proficiency
NVIDIA is seeking a Senior High-Performance LLM Training Engineer with expertise in performance analysis and optimization for large language model (LLM) training workloads. The role involves working on NVIDIA's software stack in frameworks like PyTorch and JAX, enhancing training efficiency on GPUs, and shaping future hardware roadmaps
Job Summary
This role focuses on optimizing NVIDIA's high-performance LLM software stack in frameworks like PyTorch and JAX for training on thousands of GPUs.
Candidates will implement production-quality software across layers of the deep learning platform stack from drivers to DL frameworks.
NVIDIA offers highly competitive salaries, equity, and a comprehensive benefits package for this hybrid role.
Matching Summary
Match Score: 85
NVIDIA is seeking a Senior High-Performance LLM Training Engineer with expertise in performance analysis and optimization for large language model (LLM) training workloads. The role involves working on NVIDIA's software stack in frameworks like PyTorch and JAX, enhancing training efficiency on GPUs, and shaping future hardware roadmaps.
Salary
Base: $184,000 - $356,500 USD depending on level; Equity: Eligible; Benefits: Comprehensive package included
Skills & Requirements
Must-have
Deep learning and neural network training expertise
GPU architecture fundamentals and computer architecture
C++, Python, and CUDA programming proficiency
Application performance analysis and tuning experience
Production-quality software implementation skills
Nice-to-have
Experience with proprietary processor system simulators
Background in building automated workload analysis tools
Collaboration with major cloud service providers
Interest in shaping future GPU hardware roadmaps
Key Requirements
PhD in Computer Science or related field with 5+ years experience
MS degree with 8+ years of meaningful work experience
Proven experience analyzing and tuning application performance