Deep Learning Software Engineer, Flashinfer - New College Grad 2025

Nvidia Corporation

Base: $108,000 - $178,250 (level 1) or $124,000 - ...
Deep learning framework experience
Python and c/c++ programming skills
Gpu kernel development expertise
The role involves designing and building efficient attention kernel implementations and LLM inference runtime components

Job Summary

  • The role involves designing and building efficient attention kernel implementations and LLM inference runtime components.
  • Candidates will collaborate with teams across deep learning frameworks, libraries, and GPU architecture groups.
  • The position offers eligibility for equity and benefits alongside a competitive base salary range.

Matching Summary

The role involves designing and building efficient attention kernel implementations and LLM inference runtime components.

Salary

Base: $108,000 - $178,250 (Level 1) or $124,000 - $195,500 (Level 2); Bonus/Equity: Eligible for equity; Benefits: Comprehensive benefits package included

Skills & Requirements

Must-have

  • Deep learning framework experience
  • Python and C/C++ programming skills
  • GPU kernel development expertise

Nice-to-have

  • Open source project contributions
  • Domain specific compiler background
  • FlashInfer or Flash Attention knowledge

Key Requirements

  • Bachelor's degree in Computer Science or related field
  • PhD preferred but not mandatory
  • Strong experience with PyTorch, JAX, TensorFlow, or ONNX

Work Rights

Not specified

Tailored Resume

Cover Letter