Not specified (likely hybrid or onsite based on industry standards).
Deep learning operator optimization
Gpu cpu lpu performance modeling
C c++ perl python programming
NVIDIA is seeking an AI Computing Performance Architect to enhance the performance of deep learning applications by conducting performance analysis and developing kernels for their architectures. The ideal candidate will possess a strong background in computer architecture and programming, with at least three years of relevant experience
Job Summary
This role involves conducting in-depth performance analysis to develop kernels for NVIDIA's latest architectures.
The successful candidate will identify bottlenecks and devise creative software solutions to maximize deep learning performance.
Contributions are pivotal in advancing the hardware and software that power next-generation AI applications.
Matching Summary
Match Score: 85
NVIDIA is seeking an AI Computing Performance Architect to enhance the performance of deep learning applications by conducting performance analysis and developing kernels for their architectures. The ideal candidate will possess a strong background in computer architecture and programming, with at least three years of relevant experience.
Skills & Requirements
Must-have
Deep learning operator optimization
GPU CPU LPU performance modeling
C C++ Perl Python programming
Computer architecture foundation
Assembly or SIMD programming
Nice-to-have
LLM frameworks knowledge
CUDA or OpenCL experience
Parallel programming expertise
Strong communication skills
Organizational skills
Key Requirements
MS or PhD in Computer Science, Electrical Engineering, or Mathematics
At least 3 years of professional experience with performance modeling