Senior System Software Engineer - Gpu Performance

Invidia

Multiple Locations
Base: 152,000 usd - 241,500 usd for level 3; 184,0...
Performance benchmarking on large hpc clusters
Parallel programming with communication runtimes
Debugging across hw/sw stack
NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization

Job Summary

  • NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization.
  • This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space.
  • You will collaborate with a very dynamic team across multiple time zones and contribute to innovative technologies.

Matching Summary

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization.

Salary

Base: 152,000 USD - 241,500 USD for Level 3; 184,000 USD - 287,500 USD for Level 4; Bonus/Equity: Eligible; Benefits: Eligible

Skills & Requirements

Must-have

  • Performance benchmarking on large HPC clusters
  • Parallel programming with communication runtimes
  • Debugging across HW/SW stack
  • Proficient in C/C++ and Python scripting
  • Familiarity with containers and cloud tools

Nice-to-have

  • Experience with Infiniband/Ethernet networks
  • Knowledge of CUDA programming and GPUs
  • Familiarity with Deep Learning frameworks
  • Ability to work across multiple time zones
  • Adaptability and passion to learn new tools

Key Requirements

  • M.S. or PhD in Computer Science or related field
  • 3+ years experience with parallel programming
  • Experience with MPI, NCCL, UCX, or NVSHMEM
  • Strong understanding of computer system architecture
  • Experience debugging performance issues
  • Familiarity with Kubernetes, SLURM, Ansible, Docker

Work Rights

Not specified

Tailored Resume

Cover Letter