Inference Optimization Architect, Speech Ai

Invidia

Pune, India
Optimize inference performance
Model compression techniques
Cuda kernel development
NVIDIA is an industry leader in AI and High-Performance Computing

Job Summary

  • NVIDIA is an industry leader in AI and High-Performance Computing.
  • The role focuses on optimizing Speech AI models for millions of customers.
  • Join a team at the forefront of technological advancement.

Matching Summary

NVIDIA is an industry leader in AI and High-Performance Computing.

Skills & Requirements

Must-have

  • Optimize Inference Performance
  • Model Compression techniques
  • CUDA kernel development

Nice-to-have

  • Experience with embedded systems
  • Strong collaborative skills
  • Publications in open-source projects

Key Requirements

  • Masters or BE/BTech in Computer Science
  • 10+ years of total experience
  • 5+ years on performance optimizations

Work Rights

Not specified

Tailored Resume

Cover Letter