Manager, Next-gen Ai Cluster Validation

Jobgether

United States
On-site
Ai supercomputing systems at scale
Integrating compute, networking, storage, software
Large-scale ai and hpc clusters
Lead the development and validation of next-generation AI supercomputing systems at scale

Job Summary

  • Lead the development and validation of next-generation AI supercomputing systems at scale.
  • Manage a high-performing technical team responsible for integrating compute, networking, storage, and software systems into large-scale AI and HPC clusters.
  • Design and implement tools, processes, and documentation to support cluster development, automation, and performance engineering.

Matching Summary

Lead the development and validation of next-generation AI supercomputing systems at scale.

Skills & Requirements

Must-have

  • AI supercomputing systems at scale
  • integrating compute, networking, storage, software
  • large-scale AI and HPC clusters
  • design and implement tools and processes
  • automation and performance engineering

Nice-to-have

  • strategic leadership with hands-on execution
  • fast-paced, remote-friendly environment
  • technical leader passionate about AI
  • HPC and supercomputing innovation

Key Requirements

  • Managerial experience
  • Technical leadership experience
  • Experience with AI/HPC clusters

Work Rights

Not specified

Tailored Resume

Cover Letter