Senior Ai Performance And Efficiency Engineer

Invidia

Multiple Locations
Base: 152,000 usd - 241,500 usd for level 3, 184,0...
Large scale compute infrastructure
Ml training and inference optimization
Nsight systems and nsight compute debugging
You will collaborate closely with AI/ML researchers to improve model efficiency, leading to productivity improvements and cost savings

Job Summary

  • You will collaborate closely with AI/ML researchers to improve model efficiency, leading to productivity improvements and cost savings.
  • The role involves building tools and frameworks to detect and analyze efficiency bottlenecks and delivering scalable solutions across hardware, software, and infrastructure.
  • NVIDIA offers competitive salaries, equity, benefits, and a commitment to diversity and inclusion in a rapidly growing engineering team.

Matching Summary

You will collaborate closely with AI/ML researchers to improve model efficiency, leading to productivity improvements and cost savings.

Salary

Base: 152,000 USD - 241,500 USD for Level 3, 184,000 USD - 287,500 USD for Level 4; Bonus/Equity: Eligible for equity; Benefits: Comprehensive benefits package

Skills & Requirements

Must-have

  • Large scale compute infrastructure
  • ML training and inference optimization
  • NSight Systems and NSight Compute debugging
  • Distributed training with NCCL
  • Python, Go, Bash programming
  • Cloud computing platforms experience

Nice-to-have

  • NVIDIA GPUs and CUDA programming
  • MLPerf benchmarking
  • InfiniBand with IBOP and RDMA
  • Distributed storage systems like Lustre and GPFS
  • Deep learning frameworks PyTorch and TensorFlow
  • Excellent communication and collaboration skills

Key Requirements

  • BS or equivalent in Computer Science or related field
  • Minimum 5+ years experience in large scale compute infrastructure
  • Experience with ML training and inference performance optimization
  • Proficiency in Python, Go, Bash
  • Familiarity with cloud platforms AWS, GCP, Azure
  • Dedication to ongoing learning in AI/ML infrastructure

Work Rights

Not specified

Tailored Resume

Cover Letter