Staff Systems Software Engineer- Server

Invidia

Multiple Locations
Gpu diagnostics development
C/c++ and python programming
Linux low-level system software
NVIDIA leads the world in AI infrastructure with GPUs powering advanced AI systems and data centers

Job Summary

  • NVIDIA leads the world in AI infrastructure with GPUs powering advanced AI systems and data centers.
  • The role involves leading the design and integration of software-based GPU diagnostics into manufacturing and datacenter workflows.
  • Candidates will work closely with hardware, firmware, and software teams to debug and improve diagnostics, contributing to technical documentation and automation.

Matching Summary

NVIDIA leads the world in AI infrastructure with GPUs powering advanced AI systems and data centers.

Skills & Requirements

Must-have

  • GPU diagnostics development
  • C/C++ and Python programming
  • Linux low-level system software
  • Server platform architecture knowledge
  • Debugging with gdb and perf
  • Cross-functional technical leadership

Nice-to-have

  • Experience with diagnostic tools in factory/datacenter
  • Defining diagnostic or RMA qualification flows
  • Writing high-coverage test plans
  • Using AI-assisted diagnostic tools
  • Collaborative mindset with global teams

Key Requirements

  • BS or MS in Computer Science or related field
  • 8+ years experience in system software or diagnostics
  • Strong programming skills in C/C++ and Python
  • Experience with Linux system software development
  • Understanding of x86 and ARM server architectures
  • Demonstrated debugging and problem-solving skills

Work Rights

Not specified

Tailored Resume

Cover Letter