Senior Hpc Site Reliability Engineer

NVIDIA

Multiple Locations
Large scale compute infrastructure
Job schedulers experience
Script-writing skills
We are looking for architects to help us evolve the way our private compute cloud is architected and optimized

Job Summary

  • We are looking for architects to help us evolve the way our private compute cloud is architected and optimized.
  • Provide leadership in the design and implementation of our large-scale compute cloud.
  • Help with strategic challenges we encounter, including effective resource utilization and planning for multi-year growth.

Matching Summary

We are looking for architects to help us evolve the way our private compute cloud is architected and optimized.

Skills & Requirements

Must-have

  • large scale compute infrastructure
  • job schedulers experience
  • script-writing skills

Nice-to-have

  • Linux certification
  • Kubernetes deployment experience
  • modern container networking skills

Key Requirements

  • B.sc in Computer Science or related field
  • 8+ years of experience
  • knowledge of fast distributed storage solutions

Work Rights

Not specified

Tailored Resume

Cover Letter