Deep Learning Architect, Llm Inference - New College Grad 2026

Nvidia Corporation

Base: $124,000 - $195,500 (level 2) or $152,000 - ...
Not specified
Deep learning inference serving expertise
Pytorch programming and compiler optimizations
Gpu microarchitecture performance analysis
Nvidia is seeking a Deep Learning Architect focused on LLM Inference, ideal for new college graduates in 2026. The role involves optimizing inference server performance for large language models and developing innovative benchmarking methodologies, requiring advanced knowledge in deep learning and strong programming skills

Job Summary

  • The role focuses on optimizing inference server performance for Large Language Models to maintain NVIDIA's leadership in the generative AI revolution.
  • Candidates will characterize workloads of latest LLMs and collaborate with engineers to establish standard benchmarking methodologies.
  • The position offers competitive compensation including base salary ranging from $124,000 to $241,500 plus equity and benefits.

Matching Summary

Match Score: 85

Nvidia is seeking a Deep Learning Architect focused on LLM Inference, ideal for new college graduates in 2026. The role involves optimizing inference server performance for large language models and developing innovative benchmarking methodologies, requiring advanced knowledge in deep learning and strong programming skills.

Salary

Base: $124,000 - $195,500 (Level 2) or $152,000 - $241,500 (Level 3); Bonus/Equity: Eligible for equity; Benefits: Comprehensive benefits package included

Skills & Requirements

Must-have

  • Deep learning inference serving expertise
  • PyTorch programming and compiler optimizations
  • GPU microarchitecture performance analysis
  • vLLM SGLang TRT-LLM framework knowledge
  • E2E profiling tool development

Nice-to-have

  • Experience with agentic AI tools usage
  • Database and visualization tool proficiency
  • Novel use cases for workplace automation
  • Strong written communication skills
  • Collaboration with AI startup companies

Key Requirements

  • Master's or PhD in Computer Science or related field
  • Relevant software development experience required
  • Solid understanding of CPU and GPU microarchitecture

Work Rights

Not specified

Tailored Resume

Cover Letter