Senior Performance Engineer - Llm Inference Frameworks

Nvidia Corporation

Hybrid
High-performance inference pipelines
Deep learning frameworks experience
Excellent python programming skills
NVIDIA is seeking a Senior Performance Engineer to enhance its large language model inference frameworks. The ideal candidate will have extensive experience in software development, particularly with deep learning frameworks, and a strong background in performance optimization

Job Summary

  • NVIDIA is hiring exceptional software engineers to build and optimize core inference infrastructure for large language models.
  • Your work will directly shape frameworks behind state-of-the-art LLM inference used across NVIDIA and the AI community.
  • Join us to redefine what 'fast' means for LLM inference, building frameworks that power the next generation of generative AI at scale.

Matching Summary

Match Score: 85

NVIDIA is seeking a Senior Performance Engineer to enhance its large language model inference frameworks. The ideal candidate will have extensive experience in software development, particularly with deep learning frameworks, and a strong background in performance optimization.

Skills & Requirements

Must-have

  • High-performance inference pipelines
  • Deep learning frameworks experience
  • Excellent Python programming skills

Nice-to-have

  • Hands-on experience with NVIDIA tools
  • Expertise in performance modeling
  • Strong grasp of inference efficiency trade-offs

Key Requirements

  • Bachelor's or higher degree in relevant field
  • 5+ years of relevant software development experience
  • Experience profiling and debugging performance

Work Rights

Not specified

Tailored Resume

Cover Letter