Distinguished Engineer, Cloud Site Reliability Engineering

NVIDIA

Base: 320,000 usd - 488,750 usd; bonus/equity: eli...
Hybrid
Cloud infrastructure maintenance
Highly available production environment
Distributed systems and rest apis
Serve as an SRE Architect for the GPU Private Cloud team, supporting thousands of NVIDIANs globally for interactive development, CI/CD, and QA testing

Job Summary

  • Serve as an SRE Architect for the GPU Private Cloud team, supporting thousands of NVIDIANs globally for interactive development, CI/CD, and QA testing.
  • Evaluate, identify, and develop software solutions to optimize critical software development workflows and identify performance bottlenecks to improve speed and cost efficiency of AI development and testing systems.
  • Lead software development projects, technically direct a team of engineers, and guide them to provide efficient and impactful solutions while looking for and resolving issues within software systems.

Matching Summary

Serve as an SRE Architect for the GPU Private Cloud team, supporting thousands of NVIDIANs globally for interactive development, CI/CD, and QA testing.

Salary

Base: 320,000 USD - 488,750 USD; Bonus/Equity: eligible for equity; Benefits: eligible for benefits

Skills & Requirements

Must-have

  • Cloud infrastructure maintenance
  • Highly available production environment
  • Distributed systems and REST APIs
  • Docker containers and Virtual Machines
  • Cloud technologies (OpenStack, Kubernetes, Chef/Puppet, Hadoop/Ceph/SwiftStack, LXC, Git, Perforce, JFrog, Kafka)
  • Java, Python, Shell-script programming

Nice-to-have

  • Depth in AI/ML/DL algorithms
  • Collaborative and interpersonal skills
  • Guiding and influencing others
  • Large-scale software systems development
  • Hardware cost optimization

Key Requirements

  • 18+ years systems software development
  • 1+ year dedicated to AI development/exploration
  • BS EE/CS or equivalent experience
  • Experience maintaining cloud infrastructure
  • Experience with SQL/NoSQL databases
  • Ability to work across organizational boundaries

Work Rights

Not specified

Tailored Resume

Cover Letter