Inference Optimization Architect, Speech Ai

NVIDIA

Pune, India
Inference performance optimization
Model compression techniques
Cuda kernel development
NVIDIA is an industry leader in AI and high-performance computing

Job Summary

  • NVIDIA is an industry leader in AI and high-performance computing.
  • The role focuses on optimizing Speech AI models to enhance user experience.
  • Join a team dedicated to solving real-world conversational AI challenges.

Matching Summary

NVIDIA is an industry leader in AI and high-performance computing.

Skills & Requirements

Must-have

  • Inference performance optimization
  • Model compression techniques
  • CUDA kernel development

Nice-to-have

  • Experience with embedded systems
  • Strong collaborative skills
  • Contributions to open-source projects

Key Requirements

  • Masters or BE/BTech in Computer Science
  • 10+ years of total experience
  • 5+ years on performance optimizations

Work Rights

Not specified

Tailored Resume

Cover Letter