Senior Dgx Cloud Ai Infrastructure Software Engineer

Nvidia Corporation

Base: $184,000 - $287,500 (level 4) or $224,000 - ...
**
8+ years software infrastructure experience
Large scale distributed systems development
Python c/c++ programming proficiency
** NVIDIA Corporation is seeking a Senior DGX Cloud AI Infrastructure Software Engineer to join their AI Efficiency Team, focusing on optimizing AI workloads and ensuring a stable, scalable environment for AI researchers. The role requires extensive experience in developing infrastructure for large-scale AI systems and offers a supportive culture of innovation and collaboration. **

Job Summary

  • The role involves designing and building infrastructure that enables large-scale AI pre-training, post-training, and inference workloads.
  • Candidates will define reliability metrics and optimize tools to ensure high efficiency and resiliency of NVIDIA's AI platforms.
  • The position offers autonomy to work on meaningful projects within a supportive team culture that values learning and iterative improvement.

Matching Summary

Match Score: 75

** NVIDIA Corporation is seeking a Senior DGX Cloud AI Infrastructure Software Engineer to join their AI Efficiency Team, focusing on optimizing AI workloads and ensuring a stable, scalable environment for AI researchers. The role requires extensive experience in developing infrastructure for large-scale AI systems and offers a supportive culture of innovation and collaboration. **

Salary

Base: $184,000 - $287,500 (Level 4) or $224,000 - $356,500 (Level 5); Equity: Eligible; Benefits: Eligible

Skills & Requirements

Must-have

  • 8+ years software infrastructure experience
  • Large scale distributed systems development
  • Python C/C++ programming proficiency
  • AI training and inferencing infrastructure
  • Observability platforms ELK Prometheus Loki

Nice-to-have

  • RDMA software stack NCCL IB verbs
  • PyTorch TensorFlow JAX internal understanding
  • Datacenter scale failure root cause analysis
  • Culture of blameless postmortems and risk-taking

Key Requirements

  • Minimum 8+ years developing software infrastructure
  • Bachelor's degree in Computer Science or related field
  • Strong debugging skills from application to hardware level

Work Rights

Not specified

Tailored Resume

Cover Letter