Manager, Next-generation Ai Cluster Architecture

Nvidia Corporation

Base: 224,000 usd - 356,500 usd; bonus/equity: eli...
Fully remote
8+ years in high-performance computing or machine learning
3+ years technical leadership experience
Proficiency in go python or ansible
This role involves leading a team to develop the next generation of NVIDIA AI supercomputing systems and GPU cluster architectures

Job Summary

  • This role involves leading a team to develop the next generation of NVIDIA AI supercomputing systems and GPU cluster architectures.
  • The successful candidate will author reference architectures that influence future supercomputing systems for AI and HPC both inside and outside NVIDIA.
  • NVIDIA offers highly competitive salaries ranging from 224,000 USD to 356,500 USD along with equity and comprehensive benefits.

Matching Summary

This role involves leading a team to develop the next generation of NVIDIA AI supercomputing systems and GPU cluster architectures.

Salary

Base: 224,000 USD - 356,500 USD; Bonus/Equity: Eligible for equity; Benefits: Comprehensive benefits package included

Skills & Requirements

Must-have

  • 8+ years in high-performance computing or machine learning
  • 3+ years technical leadership experience
  • Proficiency in Go Python or Ansible
  • Experience with multi-GPU and multi-node training workloads
  • Expertise in InfiniBand and RoCE networking

Nice-to-have

  • Experience leading teams in research environments at scale
  • Knowledge of open-source monitoring technologies like Prometheus
  • Track record of empowering team members and idea sharing
  • Ability to work in a remote-friendly distributed environment

Key Requirements

  • BS in Applied Science or Engineering (Masters or PhD preferred)
  • 8+ years overall experience in HPC or ML fields
  • 3+ years of technical leadership experience

Work Rights

Not specified

Tailored Resume

Cover Letter