Senior Deep Learning Systems Engineer, Datacenters

Invidia

Multiple Locations
Base: 184,000 usd - 287,500 usd for level 4, 224,0...
Hybrid
Deep learning application performance analysis
System software development with linux and cuda
Programming in c++ and python
The role involves helping develop software infrastructure to characterize and analyze a broad range of Deep Learning applications and evolve cost-efficient datacenter architectures tailored for Large Language Models

Job Summary

  • The role involves helping develop software infrastructure to characterize and analyze a broad range of Deep Learning applications and evolve cost-efficient datacenter architectures tailored for Large Language Models.
  • Candidates will work with experts to develop analysis and profiling tools in Python, bash, and C++ to measure key performance metrics of DL workloads on Nvidia systems and influence next generation systems and software stacks.
  • NVIDIA is recognized as a highly desirable employer with a forward-thinking and hardworking culture, valuing creativity and autonomy in its employees.

Matching Summary

The role involves helping develop software infrastructure to characterize and analyze a broad range of Deep Learning applications and evolve cost-efficient datacenter architectures tailored for Large Language Models.

Salary

Base: 184,000 USD - 287,500 USD for Level 4, 224,000 USD - 356,500 USD for Level 5; Bonus/Equity: Eligible for equity; Benefits: Eligible for benefits

Skills & Requirements

Must-have

  • Deep Learning application performance analysis
  • System software development with Linux and CUDA
  • Programming in C++ and Python
  • Datacenter architecture optimization
  • Profiling tools development and usage

Nice-to-have

  • Experience with containerization platforms like Docker
  • Knowledge of datacenter workload managers such as Slurm
  • Multi-site and multi-functional team collaboration
  • Strong drive for task ownership
  • Background in GPU kernels and DL frameworks

Key Requirements

  • Bachelor’s degree in Electrical Engineering or Computer Science or equivalent experience
  • 8 years or more of relevant experience
  • Experience with system software, operating systems, compilers, GPU kernels, or DL frameworks
  • Programming experience in C++ and Python
  • Understanding of computer system architecture and performance analysis

Work Rights

Not specified

Tailored Resume

Cover Letter