Senior Genai Algorithms Engineer — Model Optimizations For Inference

Invidia

Multiple Locations
Base: 152,000 usd - 287,500 usd depending on level...
Deep learning model optimization
Python and pytorch proficiency
Gpu kernel development with cuda or triton
NVIDIA is at the forefront of the generative AI revolution, focusing on optimizing AI models for maximal inference efficiency

Job Summary

  • NVIDIA is at the forefront of the generative AI revolution, focusing on optimizing AI models for maximal inference efficiency.
  • This role offers a unique opportunity to work at the intersection of research and engineering, pushing the boundaries of large-scale AI optimization.
  • NVIDIA offers highly competitive salaries and a comprehensive benefits package in a diverse and inclusive work environment.

Matching Summary

NVIDIA is at the forefront of the generative AI revolution, focusing on optimizing AI models for maximal inference efficiency.

Salary

Base: 152,000 USD - 287,500 USD depending on level; Bonus/Equity: Eligible for equity; Benefits: Comprehensive benefits package

Skills & Requirements

Must-have

  • Deep learning model optimization
  • Python and PyTorch proficiency
  • GPU kernel development with CUDA or Triton
  • Experience with large language models
  • Software design and performance analysis

Nice-to-have

  • Contributions to ML frameworks
  • Experience with NVIDIA deep learning SDKs
  • Training generative AI on large GPU clusters
  • Familiarity with open-source inference frameworks
  • Strong communication and collaboration skills

Key Requirements

  • Master’s, PhD, or equivalent experience
  • 5+ years deep learning experience
  • Strong software design and debugging skills
  • Proficiency in modern ML frameworks
  • Ability to work independently and collaboratively

Work Rights

Not specified

Tailored Resume

Cover Letter