Develop high-performance software solutions at the intersection of hardware and software, developing high-performance software for cutting-edge AI and HPC workloads
Job Summary
Develop high-performance software solutions at the intersection of hardware and software, developing high-performance software for cutting-edge AI and HPC workloads.
Implement, optimize, and scale deep learning operations to fully leverage our custom, massively parallel processor architecture.
Build a library of parallel and distributed algorithms that maximize compute utilization and push the boundaries of training efficiency for state-of-the-art AI models.
Matching Summary
Develop high-performance software solutions at the intersection of hardware and software, developing high-performance software for cutting-edge AI and HPC workloads.
Skills & Requirements
Must-have
C++ and Python programming
low-level assembly and CSL
parallel programming algorithms
performance tuning and validation
deep learning operations optimization
Nice-to-have
experience in kernel development
familiarity with ML frameworks
programming accelerators
HPC kernels optimization
understanding of hardware architecture
Key Requirements
Bachelor’s, Master’s, PhD or foreign equivalents
Computer Science, Computer Engineering, Mathematics, or related fields