Software Engineer, Machine Learning Inference - New College Grad 2026

Nvidia Corporation

Base: 108,000 usd - 195,500 usd; bonus/equity: not...
Hybrid
C++ programming
Python programming
Cuda development
Design, develop and optimize NVIDIA TensorRT and TensorRT-LLM to supercharge inference applications for datacenter, workstations, and PCs

Job Summary

  • Design, develop and optimize NVIDIA TensorRT and TensorRT-LLM to supercharge inference applications for datacenter, workstations, and PCs.
  • Develop software in C++, Python, and CUDA for seamless and efficient deployment of state-of-the-art LLMs and Generative AI models.
  • Collaborate with deep learning experts and GPU architects throughout the company to influence Hardware and Software design for inference.

Matching Summary

Design, develop and optimize NVIDIA TensorRT and TensorRT-LLM to supercharge inference applications for datacenter, workstations, and PCs.

Salary

Base: 108,000 USD - 195,500 USD; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • C++ programming
  • Python programming
  • CUDA development
  • Deep Learning Frameworks
  • Compilers
  • System Software

Nice-to-have

  • Inference backends and compilers for GPUs
  • Machine Learning techniques
  • GPU programming with CUDA or OpenCL
  • LLM inference frameworks
  • Deep learning frameworks
  • Close-to-metal performance analysis

Key Requirements

  • BS, MS, PhD or equivalent experience
  • Software development experience on a large codebase
  • Strong proficiency in C++
  • Rust or Python programming languages
  • Excellent problem-solving skills
  • Strong communication skills

Work Rights

Not specified

Tailored Resume

Cover Letter