Senior Platform And Engops Engineer - Cluster Operations

NVIDIA

Base: 176,000 usd - 276,000 usd for level 4; 208,0...
Automation expertise in ansible, python, shell scripting
Experience managing large gpu clusters
Proficient with linux fundamentals
NVIDIA is at the forefront of AI and High-Performance Computing innovations

Job Summary

  • NVIDIA is at the forefront of AI and High-Performance Computing innovations.
  • The role involves developing automated tools for GPU cluster management.
  • Candidates will collaborate with dynamic teams across multiple time zones.

Matching Summary

NVIDIA is at the forefront of AI and High-Performance Computing innovations.

Salary

Base: 176,000 USD - 276,000 USD for Level 4; 208,000 USD - 333,500 USD for Level 5; Benefits: Not specified

Skills & Requirements

Must-have

  • Automation expertise in Ansible, Python, Shell Scripting
  • Experience managing large GPU clusters
  • Proficient with Linux fundamentals

Nice-to-have

  • Familiarity with resource scheduling managers
  • Experience with industry standard alerting tools
  • Hands-on experience with GPU-focused hardware

Key Requirements

  • BS or MS in relevant field or equivalent experience
  • 8+ years of hands-on experience in cluster administration

Work Rights

Not specified

Tailored Resume

Cover Letter