Deep Learning Software Engineer, Tensorrt Performance - New College Grad 2026

NVIDIA

Base: 124,000 usd - 195,500 usd (level 2); 152,000...
Deep learning software engineering
Tensorrt performance optimization
Gpu-accelerated deep learning
Establish groundbreaking performance benchmarking methodologies and analysis workflows and identify performance issues and opportunities for NVIDIA’s inference ecosystem

Job Summary

  • Establish groundbreaking performance benchmarking methodologies and analysis workflows and identify performance issues and opportunities for NVIDIA’s inference ecosystem.
  • Contribute features and code to NVIDIA/OSS inference frameworks including but not limited to TensorRT/TensorRT-EdgeLLM/Torch-TensorRT.
  • Work with cross-collaborative teams inside and outside of NVIDIA across generative AI, automotive, robotics, image understanding, and speech understanding to set directions and develop innovative inference solutions.

Matching Summary

Establish groundbreaking performance benchmarking methodologies and analysis workflows and identify performance issues and opportunities for NVIDIA’s inference ecosystem.

Salary

Base: 124,000 USD - 195,500 USD (Level 2); 152,000 USD - 241,500 USD (Level 3); Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • Deep Learning Software Engineering
  • TensorRT performance optimization
  • GPU-accelerated deep learning
  • C++ and Python programming
  • DL frameworks and inference libraries

Nice-to-have

  • Architectural knowledge of GPUs
  • Modern deep learning models
  • CUDA/Triton programming
  • LLM inference framework contributions
  • Optimizing for low-latency systems

Key Requirements

  • Bachelors, Masters, PhD, or equivalent experience
  • 2 years of relevant software development experience
  • Must have relevant work authorization

Work Rights

Not specified

Tailored Resume

Cover Letter