Ai Computing Performance Architect

Nvidia Corporation

Not specified (likely hybrid or onsite based on industry standards).
Deep learning operator optimization
Gpu cpu lpu performance modeling
C c++ perl python programming
NVIDIA is seeking an AI Computing Performance Architect to enhance the performance of deep learning applications by conducting performance analysis and developing kernels for their architectures. The ideal candidate will possess a strong background in computer architecture and programming, with at least three years of relevant experience

Job Summary

  • This role involves conducting in-depth performance analysis to develop kernels for NVIDIA's latest architectures.
  • The successful candidate will identify bottlenecks and devise creative software solutions to maximize deep learning performance.
  • Contributions are pivotal in advancing the hardware and software that power next-generation AI applications.

Matching Summary

Match Score: 85

NVIDIA is seeking an AI Computing Performance Architect to enhance the performance of deep learning applications by conducting performance analysis and developing kernels for their architectures. The ideal candidate will possess a strong background in computer architecture and programming, with at least three years of relevant experience.

Skills & Requirements

Must-have

  • Deep learning operator optimization
  • GPU CPU LPU performance modeling
  • C C++ Perl Python programming
  • Computer architecture foundation
  • Assembly or SIMD programming

Nice-to-have

  • LLM frameworks knowledge
  • CUDA or OpenCL experience
  • Parallel programming expertise
  • Strong communication skills
  • Organizational skills

Key Requirements

  • MS or PhD in Computer Science, Electrical Engineering, or Mathematics
  • At least 3 years of professional experience with performance modeling
  • Hands-on assembly or SIMD programming experience

Work Rights

Not specified

Tailored Resume

Cover Letter