Manager, Next-gen Ai Cluster Validation

Nvidia Corporation

Base: 224,000 usd - 356,500 usd; bonus/equity: eli...
Fully remote
8+ years experience in hpc or ml fields
3+ years technical leadership experience
Proficiency in go, python, or ansible
This leader will play a crucial role in the development and validation of AI computing systems at scale

Job Summary

  • This leader will play a crucial role in the development and validation of AI computing systems at scale.
  • The role involves collaborating with top experts to invent new supercomputing architectures for machine learning and HPC.
  • NVIDIA offers highly competitive salaries ranging from 224,000 USD to 356,500 USD along with equity and comprehensive benefits.

Matching Summary

This leader will play a crucial role in the development and validation of AI computing systems at scale.

Salary

Base: 224,000 USD - 356,500 USD; Bonus/Equity: Eligible for equity; Benefits: Comprehensive benefits package included

Skills & Requirements

Must-have

  • 8+ years experience in HPC or ML fields
  • 3+ years technical leadership experience
  • Proficiency in Go, Python, or Ansible
  • Experience with distributed engineering teams
  • BS in Applied Science or Engineering

Nice-to-have

  • Experience leading HPC research environments
  • Knowledge of multi-GPU training workloads
  • Expertise in InfiniBand and RoCE networking
  • Familiarity with Prometheus and Grafana
  • Track record of growing diverse teams

Key Requirements

  • BS (Masters or PhD preferred) in Applied Science or Engineering
  • 8+ overall years experience in high-performance computing or machine learning
  • 3+ years of technical leadership experience

Work Rights

Not specified

Tailored Resume

Cover Letter