Distinguished Engineer, Cloud Site Reliability Engineering

Invidia

CA, United States
Base: 320,000 usd - 488,750 usd; bonus/equity: eli...
**
Cloud infrastructure maintenance
Distributed systems and rest apis
Docker containers and virtual machines
** NVIDIA is seeking a Distinguished Engineer for their Cloud Site Reliability Engineering (SRE) team, focusing on the development and optimization of cloud infrastructure used by internal teams. The ideal candidate will have extensive experience in software development, cloud technologies, and a strong background in AI, along with leadership skills to guide engineering teams. **

Job Summary

  • NVIDIA's Cloud SRE Architect role involves working with global teams to optimize infrastructure for thousands of software engineers worldwide.
  • The position includes architecting and supporting end-to-end CI/CD systems and improving performance and cost efficiency of AI development and testing systems.
  • Candidates will lead software development projects and implement critical metrics using analytics and dashboards.

Matching Summary

Match Score: 75

** NVIDIA is seeking a Distinguished Engineer for their Cloud Site Reliability Engineering (SRE) team, focusing on the development and optimization of cloud infrastructure used by internal teams. The ideal candidate will have extensive experience in software development, cloud technologies, and a strong background in AI, along with leadership skills to guide engineering teams. **

Salary

Base: 320,000 USD - 488,750 USD; Bonus/Equity: Eligible for equity; Benefits: Eligible for benefits

Skills & Requirements

Must-have

  • Cloud infrastructure maintenance
  • Distributed systems and REST APIs
  • Docker containers and Virtual Machines
  • CI/CD system architecture and support
  • Programming in JAVA, Python, Shell-script
  • SQL/NoSQL database systems
  • Cloud technologies like Kubernetes and OpenStack

Nice-to-have

  • AI, Machine Learning and Deep Learning expertise
  • Strong collaborative and interpersonal skills
  • Experience with modular architecture in large-scale software
  • Hardware cost optimization focus
  • Leading and guiding engineering teams

Key Requirements

  • BS EE/CS or equivalent experience
  • 18+ years systems software development
  • At least 1 year AI development experience
  • Experience maintaining highly available production environments
  • Ability to work across multinational teams

Work Rights

Not specified

Tailored Resume

Cover Letter