Senior High-performance Llm Training Engineer

NVIDIA

Base: $184,000 - $356,500 usd depending on level; ...
Hybrid
Deep learning and neural network training expertise
Gpu architecture fundamentals and computer architecture
C++, python, and cuda programming proficiency
NVIDIA is seeking a Senior High-Performance LLM Training Engineer with expertise in performance analysis and optimization for large language model (LLM) training workloads. The role involves working on NVIDIA's software stack in frameworks like PyTorch and JAX, enhancing training efficiency on GPUs, and shaping future hardware roadmaps

Job Summary

  • This role focuses on optimizing NVIDIA's high-performance LLM software stack in frameworks like PyTorch and JAX for training on thousands of GPUs.
  • Candidates will implement production-quality software across layers of the deep learning platform stack from drivers to DL frameworks.
  • NVIDIA offers highly competitive salaries, equity, and a comprehensive benefits package for this hybrid role.

Matching Summary

Match Score: 85

NVIDIA is seeking a Senior High-Performance LLM Training Engineer with expertise in performance analysis and optimization for large language model (LLM) training workloads. The role involves working on NVIDIA's software stack in frameworks like PyTorch and JAX, enhancing training efficiency on GPUs, and shaping future hardware roadmaps.

Salary

Base: $184,000 - $356,500 USD depending on level; Equity: Eligible; Benefits: Comprehensive package included

Skills & Requirements

Must-have

  • Deep learning and neural network training expertise
  • GPU architecture fundamentals and computer architecture
  • C++, Python, and CUDA programming proficiency
  • Application performance analysis and tuning experience
  • Production-quality software implementation skills

Nice-to-have

  • Experience with proprietary processor system simulators
  • Background in building automated workload analysis tools
  • Collaboration with major cloud service providers
  • Interest in shaping future GPU hardware roadmaps

Key Requirements

  • PhD in Computer Science or related field with 5+ years experience
  • MS degree with 8+ years of meaningful work experience
  • Proven experience analyzing and tuning application performance

Work Rights

Not specified

Tailored Resume

Cover Letter