Senior System Software Engineer - Ai Performance And Efficiency Tools

Invidia

Us, CA, United States
Base: 184,000 usd - 287,500 usd for level 4, 224,0...
Hybrid
C++ and python software development
Deep learning frameworks knowledge
Gpu cluster job scheduling experience
This role involves building sophisticated tools that empower NVIDIA engineers to improve performance and power efficiency of AI workloads on GPU clusters

Job Summary

  • This role involves building sophisticated tools that empower NVIDIA engineers to improve performance and power efficiency of AI workloads on GPU clusters.
  • You will collaborate with hardware architects and software teams to propose and implement new features based on real-world use cases.
  • The position offers a competitive salary range, equity, benefits, and the opportunity to work with global teams in a hybrid work environment.

Matching Summary

This role involves building sophisticated tools that empower NVIDIA engineers to improve performance and power efficiency of AI workloads on GPU clusters.

Salary

Base: 184,000 USD - 287,500 USD for Level 4, 224,000 USD - 356,500 USD for Level 5; Bonus/Equity: Eligible for equity; Benefits: Eligible for benefits

Skills & Requirements

Must-have

  • C++ and Python software development
  • Deep Learning frameworks knowledge
  • GPU cluster job scheduling experience
  • NVIDIA GPUs and CUDA programming
  • AI workload profiling and analysis

Nice-to-have

  • Strong problem-solving skills
  • Customer-facing communication skills
  • Passion for continuous learning
  • Experience with Linux device drivers
  • Knowledge of GPU and CPU architecture

Key Requirements

  • BS+ in Computer Science or related
  • 5+ years software development experience
  • Experience with PyTorch and TensorFlow
  • Knowledge of Slurm or Kubernetes scheduling
  • Experience with CUDA and NCCL
  • Ability to work with multiple global groups

Work Rights

Not specified

Tailored Resume

Cover Letter