Senior Systems Engineer – High-performance Ai And Networking Applications

NVIDIA

Base: 184,000 usd - 287,500 usd (level 4); 224,000...
Nvlink, nvswitch, and infiniband powered infrastructures
Ai/hpc job schedulers and orchestrators
Mpi and nccl workflows
Collaborate with networking teams to plan, implement, and evaluate performance benchmarks on NVLINK, NVSwitch, and InfiniBand powered infrastructures

Job Summary

  • Collaborate with networking teams to plan, implement, and evaluate performance benchmarks on NVLINK, NVSwitch, and InfiniBand powered infrastructures.
  • Act as a primary resource for fixing networking and hardware integration issues, focusing on scalable multi-node systems.
  • Offer technical mentorship and documentation for internal teams and external partners on standard methodologies in HPC networking deployments.

Matching Summary

Collaborate with networking teams to plan, implement, and evaluate performance benchmarks on NVLINK, NVSwitch, and InfiniBand powered infrastructures.

Salary

Base: 184,000 USD - 287,500 USD (Level 4); 224,000 USD - 356,500 USD (Level 5); Equity: eligible; Benefits: eligible

Skills & Requirements

Must-have

  • NVLINK, NVSwitch, and InfiniBand powered infrastructures
  • AI/HPC job schedulers and orchestrators
  • MPI and NCCL workflows
  • High-Speed Networking (InfiniBand, RDMA, RoCE, EFA)
  • Deep Learning Inference frameworks
  • multi-node systems performance investigation

Nice-to-have

  • datacenter automation
  • advanced network protocols
  • large HPC or AI clusters production support
  • distributed storage systems
  • cluster management and monitoring tools

Key Requirements

  • 8+ years of proven experience in AI/HPC Infrastructure
  • BS/MS or PhD in Computer Science, Engineering, or related field, or equivalent experience
  • Familiarity with InfiniBand, NVLINK, and high-speed networking

Work Rights

Not specified

Tailored Resume

Cover Letter