Senior System Software Engineer, Nccl - Partner Enablement
Invidia
Multiple Locations
Base: 152,000 usd - 241,500 usd for level 3, 184,0...
Parallel programming with communication runtimes
C/c++ programming and performance analysis
High performance networking (infiniband, roce, ethernet)
NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization
Job Summary
NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization.
You will engage with partners and customers to root cause functional and performance issues reported with NCCL and develop tools and automation for new systems including cloud platforms.
The role offers a competitive base salary range, equity, benefits, and the opportunity to contribute to innovative AI networking technologies.
Matching Summary
NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization.
Salary
Base: 152,000 USD - 241,500 USD for Level 3, 184,000 USD - 287,500 USD for Level 4; Bonus/Equity: Eligible for equity; Benefits: Eligible for benefits
Skills & Requirements
Must-have
Parallel programming with communication runtimes
C/C++ programming and performance analysis
High performance networking (Infiniband, RoCE, Ethernet)
Linux fundamentals and scripting (Python)
Multi-node cluster HPC application support
Nice-to-have
Performance benchmarking on HPC clusters
System administration for large clusters
Network configuration debugging
CUDA programming and GPU familiarity
Machine Learning and Deep Learning frameworks knowledge
Adaptability and cross-team communication
Key Requirements
B.S./M.S. degree in CS/CE or equivalent experience
5+ years of relevant experience
Experience with MPI, NCCL, UCX, or NVSHMEM
Experience with HPC or AI research community
Familiarity with Docker, Kubernetes, SLURM, Ansible