Manager, Site Reliability Engineering

NVIDIA

Us, CA, United States
Base: 208,000 usd - 333,500 usd; bonus/equity: eli...
Hybrid
Site reliability engineering management
Large scale cloud infrastructure
Python and scripting programming
You will be leading the team of site reliability engineers responsible for automating maintenance of 10000+ hosts and providing support to customers towards debugging workflows

Job Summary

  • You will be leading the team of site reliability engineers responsible for automating maintenance of 10000+ hosts and providing support to customers towards debugging workflows.
  • NVIDIA is widely considered to be one of the technology world’s most desirable employers with some of the most brilliant and talented people in the world working for us.
  • Your base salary will be determined based on your location, experience, and the pay of employees in similar positions with eligibility for equity and benefits.

Matching Summary

You will be leading the team of site reliability engineers responsible for automating maintenance of 10000+ hosts and providing support to customers towards debugging workflows.

Salary

Base: 208,000 USD - 333,500 USD; Bonus/Equity: Eligible for equity; Benefits: Eligible for benefits

Skills & Requirements

Must-have

  • Site reliability engineering management
  • Large scale cloud infrastructure
  • Python and scripting programming
  • Service level agreement maintenance
  • Debugging and problem solving skills
  • Agile process and methodologies

Nice-to-have

  • Experience managing small engineering teams
  • Data center operations experience
  • Algorithm design and optimization
  • Cross time zone collaboration
  • Continuous improvement focus

Key Requirements

  • 8+ years industry experience
  • 2+ years people management experience
  • BS/MS in Computer Science or equivalent
  • Experience maintaining large scale cloud infrastructure
  • Proven Agile delivery track record

Work Rights

Not specified

Tailored Resume

Cover Letter