Senior Deep Learning Software Engineer, Llm Performance

Nvidia Corporation

CA, United States
Base: 184,000 usd - 356,500 usd; bonus/equity: not...
On-site
Llm inference performance analysis
Tensorrt llm, vllm, sglang
Cuda kernel development
Nvidia Corporation is seeking a Senior Deep Learning Software Engineer focused on enhancing LLM performance. The role requires extensive software development experience, particularly in deep learning frameworks, to optimize and deploy models across various NVIDIA architectures

Job Summary

  • Analyze and improve the performance of LLM inference, enabling breakthroughs in areas like LLM, Generative AI, Recommenders and Vision.
  • Implement LLM inference, serving and deployment algorithms and optimizations using TensorRT LLM, VLLM, SGLang, Triton and CUDA kernels.
  • Contribute features and code to NVIDIA/OSS LLM frameworks, inference benchmarking frameworks, TensorRT, and Triton.

Matching Summary

Match Score: 85

Nvidia Corporation is seeking a Senior Deep Learning Software Engineer focused on enhancing LLM performance. The role requires extensive software development experience, particularly in deep learning frameworks, to optimize and deploy models across various NVIDIA architectures.

Salary

Base: 184,000 USD - 356,500 USD; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • LLM inference performance analysis
  • TensorRT LLM, VLLM, SGLang
  • CUDA kernel development
  • Python/C/C++ programming
  • Deep learning framework experience

Nice-to-have

  • Performance modeling and profiling
  • GPU architecture knowledge
  • High-performance application optimization

Key Requirements

  • Bachelors, Masters, PhD, or equivalent experience
  • At least 8 years of software development experience
  • Experience with PyTorch, JAX, or TensorFlow

Work Rights

Not specified

Tailored Resume

Cover Letter