Senior Software Engineer, Cloud-native Stack – Csp Engagements

Invidia

Multiple Locations
Base: 184,000 usd - 287,500 usd for level 4, 224,0...
Kubernetes internals expertise
Slurm scheduler and plugins
Multi-rack multi-tenant cluster debugging
You will define customer workflows, prototype stack enhancements, and debug complex Kubernetes and Slurm issues in multi-rack, multi-tenant AI datacenters

Job Summary

  • You will define customer workflows, prototype stack enhancements, and debug complex Kubernetes and Slurm issues in multi-rack, multi-tenant AI datacenters.
  • The role involves driving architecture reviews, creating reproducible testbeds, and delivering technical collateral including design docs and demo scripts.
  • NVIDIA offers competitive base salaries with equity and benefits, and fosters a diverse and inclusive work environment.

Matching Summary

You will define customer workflows, prototype stack enhancements, and debug complex Kubernetes and Slurm issues in multi-rack, multi-tenant AI datacenters.

Salary

Base: 184,000 USD - 287,500 USD for Level 4, 224,000 USD - 356,500 USD for Level 5; Bonus/Equity: Eligible for equity; Benefits: Eligible for benefits

Skills & Requirements

Must-have

  • Kubernetes internals expertise
  • Slurm scheduler and plugins
  • Multi-rack multi-tenant cluster debugging
  • GPU integration in containerized clusters
  • CI/CD and infrastructure-as-code familiarity
  • Distributed systems software development

Nice-to-have

  • Upstream open source contributions
  • GPU computing and CUDA experience
  • Customer-facing engineering background
  • Excellent communication skills
  • Collaboration with cross-functional teams

Key Requirements

  • 6+ years professional software development
  • BS or MS in Computer Science or related field
  • Experience with Go, Rust, C/C++, or Python
  • Customer requirements gathering and PoC ownership
  • Experience with RDMA/RoCE networking
  • Not specified

Work Rights

Not specified

Tailored Resume

Cover Letter