Senior Deep Learning Software Engineer, Tensorrt Performance

NVIDIA

Base: 152,000 usd - 287,500 usd; bonus/equity: equ...
Hybrid
Deep learning software engineering
Tensorrt performance optimization
Gpu-accelerated inference
Establish groundbreaking performance benchmarking methodologies and analysis workflows and identify performance issues and opportunities for NVIDIA’s inference ecosystem

Job Summary

  • Establish groundbreaking performance benchmarking methodologies and analysis workflows and identify performance issues and opportunities for NVIDIA’s inference ecosystem.
  • Develop new model pipelines for NVIDIA’s inference ecosystem with optimized performance including but not limited to areas like quantization, scheduling, memory management, and distributed inference to set the gold standard for Gen AI performance.
  • Work with cross-collaborative teams inside and outside of NVIDIA across generative AI, automotive, robotics, image understanding, and speech understanding to set directions and develop innovative inference solutions.

Matching Summary

Establish groundbreaking performance benchmarking methodologies and analysis workflows and identify performance issues and opportunities for NVIDIA’s inference ecosystem.

Salary

Base: 152,000 USD - 287,500 USD; Bonus/Equity: Equity; Benefits: Benefits

Skills & Requirements

Must-have

  • Deep Learning Software Engineering
  • TensorRT performance optimization
  • GPU-accelerated inference
  • C++ and Python programming
  • DL frameworks and inference libraries

Nice-to-have

  • GPU architectural knowledge
  • Modern deep learning models
  • Low-latency systems optimization
  • Embedded AI pipelines

Key Requirements

  • 3+ years software development experience
  • Bachelors, Masters, PhD, or equivalent experience
  • Experience with performance analysis and optimization

Work Rights

Not specified

Tailored Resume

Cover Letter