Stack AV is seeking a Staff Software Engineer for their ML Acceleration team, focusing on optimizing machine learning models for autonomous systems in the trucking industry. The ideal candidate will have extensive experience in GPU programming and optimization, along with proficiency in C++ and Python
Job Summary
The ML Training Acceleration team's core mission is to deliver a training system that is reliable, scalable, user-friendly and observable.
Responsibilities include analyzing ML models to identify and resolve performance bottlenecks and incorporating OSS tools to enable ML engineers self-sufficiently profile and optimize models.
The role requires implementing optimizations using CUDA, Triton, and custom kernels, and promoting Engineering Excellence within the team.
Matching Summary
Match Score: 85
Stack AV is seeking a Staff Software Engineer for their ML Acceleration team, focusing on optimizing machine learning models for autonomous systems in the trucking industry. The ideal candidate will have extensive experience in GPU programming and optimization, along with proficiency in C++ and Python.
Skills & Requirements
Must-have
GPU programming and optimization
PyTorch optimization techniques
CUDA programming
Triton language for GPU kernels
Custom GPU kernel development
Nice-to-have
Autonomous vehicles (AV) experience
Key Requirements
5+ years of experience
Bachelor’s or Master’s degree
Strong programming skills in C++ and Python
Proven experience in GPU programming and optimization
Familiarity with deep learning frameworks, especially PyTorch
TensorRT implementation
ONNX model conversion and deployment
Deep understanding of GPU architectures and performance optimization
Work Rights
May require verification of residence, US person status, and/or citizenship status due to national security laws and export control regulations.