Product Manager, Health Automation And Resilience

Invidia

Multiple Locations
Base: 168,000 usd - 258,750 usd for level 4; 208,0...
Hybrid
Experience with cloud infrastructure
Strong architectural understanding of telemetry systems
Ability to craft clear product requirements
NVIDIA DGX Cloud is seeking a technical Product Manager for Health Automation and Resilience efforts in AI infrastructure

Job Summary

  • NVIDIA DGX Cloud is seeking a technical Product Manager for Health Automation and Resilience efforts in AI infrastructure.
  • The role involves developing products for fault detection and automated repair workflows to enhance GPU fleet performance.
  • Candidates will collaborate with engineering teams to improve cloud provider efficiency and end-user experience at scale.

Matching Summary

NVIDIA DGX Cloud is seeking a technical Product Manager for Health Automation and Resilience efforts in AI infrastructure.

Salary

Base: 168,000 USD - 258,750 USD for Level 4; 208,000 USD - 327,750 USD for Level 5; Benefits: Not specified

Skills & Requirements

Must-have

  • Experience with cloud infrastructure
  • Strong architectural understanding of telemetry systems
  • Ability to craft clear product requirements

Nice-to-have

  • Experience with GPU-accelerated compute
  • Knowledge of Kubernetes operators
  • Contributions to open-source communities

Key Requirements

  • Bachelor’s degree in Computer Science or Engineering
  • 8+ years of relevant experience
  • Experience in reliability engineering

Work Rights

Not specified

Tailored Resume

Cover Letter