Software Engineer, Machine Learning Inference - New College Grad 2026

NVIDIA

Base: 108,000 usd - 195,500 usd; bonus/equity: not...
Hybrid
C++ programming
Python programming
Cuda development
Design, develop and optimize NVIDIA TensorRT and TensorRT-LLM to supercharge inference applications for datacenter, workstations, and PCs

Job Summary

  • Design, develop and optimize NVIDIA TensorRT and TensorRT-LLM to supercharge inference applications for datacenter, workstations, and PCs.
  • Develop software in C++, Python, and CUDA for seamless and efficient deployment of state-of-the-art LLMs and Generative AI models.
  • Collaborate with deep learning experts and GPU architects throughout the company to influence Hardware and Software design for inference.

Matching Summary

Design, develop and optimize NVIDIA TensorRT and TensorRT-LLM to supercharge inference applications for datacenter, workstations, and PCs.

Salary

Base: 108,000 USD - 195,500 USD; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • C++ programming
  • Python programming
  • CUDA development
  • Deep Learning Frameworks
  • System Software development

Nice-to-have

  • GPU inference backends
  • LLM inference frameworks
  • Close-to-metal performance analysis
  • Fast-paced collaborative environment

Key Requirements

  • BS, MS, PhD or equivalent experience
  • Software development experience on large codebase
  • Strong proficiency in C++
  • Rust or Python programming languages
  • Experience in developing Deep Learning Frameworks, Compilers, or System Software

Work Rights

Not specified

Tailored Resume

Cover Letter