Senior Deep Learning Inference Performance Architect

NVIDIA

Base: 184,000 usd - 356,500 usd; bonus/equity: equ...
Gpu architecture details
Ai inference workloads
Performance optimized low level code
Develop innovative GPU and system architectures to extend the state of the art in AI Inference performance and efficiency

Job Summary

  • Develop innovative GPU and system architectures to extend the state of the art in AI Inference performance and efficiency.
  • Write efficient software for AI Inference, including CUDA kernels, framework level code, and application level code.
  • NVIDIA is widely considered to be one of the technology world’s most desirable employers.

Matching Summary

Develop innovative GPU and system architectures to extend the state of the art in AI Inference performance and efficiency.

Salary

Base: 184,000 USD - 356,500 USD; Bonus/Equity: Equity and benefits; Benefits: Benefits

Skills & Requirements

Must-have

  • GPU architecture details
  • AI Inference workloads
  • performance optimized low level code
  • CUDA kernel development
  • deep learning software optimization

Nice-to-have

  • creative engineer
  • autonomous and love a challenge
  • forward-thinking and hard working people

Key Requirements

  • MS or PhD in CS, EE, Math or equivalent experience
  • 5+ years of relevant experience
  • Strong mathematical foundation in machine learning and deep learning
  • Expert programming skills in C, C++, and Python
  • Familiarity with GPU computing (CUDA or similar) and HPC (MPI, OpenMP)
  • Strong knowledge in computer architecture

Work Rights

Not specified

Tailored Resume

Cover Letter