Infrastructure Operations Manager

Nscaleoperationsukltd

Us
$60,000—$140,000 usd py
On-site
Ai data center operations
Hpc and gpu systems management
24/7 operational reliability
Nscale is seeking an Infrastructure Operations Manager to oversee their AI Data Center, ensuring operational excellence and managing a team of engineers and technicians. The ideal candidate will have substantial experience in managing data centers, particularly in HPC and GPU environments, along with strong leadership skills

Job Summary

  • Oversee all devices, systems, and infrastructure at the data center, ensuring 24/7 operational reliability and optimal performance for high-performance AI workloads.
  • Lead and mentor a team of engineers, technicians, and support staff, providing training and managing shift schedules to guarantee continuous coverage.
  • Serve as the primary POC for clients, providing regular reporting on site SLA and KPI’s, and collaborate with vendors and contractors to manage procurement, repairs, and upgrades.

Matching Summary

Match Score: 85

Nscale is seeking an Infrastructure Operations Manager to oversee their AI Data Center, ensuring operational excellence and managing a team of engineers and technicians. The ideal candidate will have substantial experience in managing data centers, particularly in HPC and GPU environments, along with strong leadership skills.

Salary

$60,000—$140,000 USD

Skills & Requirements

Must-have

  • AI Data Center Operations
  • HPC and GPU Systems Management
  • 24/7 Operational Reliability
  • Power, Cooling, and Environmental Monitoring
  • Team Leadership and Development
  • Client SLA and KPI Reporting

Nice-to-have

  • NVIDIA GPUs and AI Frameworks
  • Hybrid Cloud/HPC Environments
  • Continuous Learning and Innovation

Key Requirements

  • Bachelor's degree in Computer Science, Engineering, or related field
  • 5+ years of data center management experience
  • Experience managing HPC and GPU environments
  • Proven leadership skills
  • Strong understanding of data center power, cooling, and environmental systems
  • Experience with inventory and spare parts management
  • On-site presence required with occasional travel
  • Participate in on-call rotation

Work Rights

Not specified

Tailored Resume

Cover Letter