Ai Inference Performance Engineer

Invidia

CA, United States
Base: 152,000 usd - 241,500 usd; bonus/equity: not...
Hybrid
Deep learning frameworks expertise
Strong python or c++ skills
Experience in performance optimization
The team optimizes and benchmarks GenAI inference on NVIDIA's latest accelerators

Job Summary

  • The team optimizes and benchmarks GenAI inference on NVIDIA's latest accelerators.
  • You will drive industry benchmark results and own the end-to-end optimization pipeline.
  • NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer.

Matching Summary

The team optimizes and benchmarks GenAI inference on NVIDIA's latest accelerators.

Salary

Base: 152,000 USD - 241,500 USD; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • Deep learning frameworks expertise
  • Strong Python or C++ skills
  • Experience in performance optimization

Nice-to-have

  • Experience with LLM frameworks
  • Performance modeling and profiling
  • GPU programming experience

Key Requirements

  • BS, MS, or PhD in relevant fields
  • 5+ years of software development experience
  • Deep understanding of LLM/VLM architectures

Work Rights

Not specified

Tailored Resume

Cover Letter