AWS designs custom SoCs that power the world's largest machine learning training and inference clusters
Job Summary
AWS designs custom SoCs that power the world's largest machine learning training and inference clusters.
You will debug complex hardware/software interactions across the full software stack, from register-level bring-up to performance analysis on live silicon.
You'll work on software that runs on chips no one outside the team has seen yet, solving problems that don't have Stack Overflow answers.
Matching Summary
AWS designs custom SoCs that power the world's largest machine learning training and inference clusters.
Skills & Requirements
Must-have
SoC models development
Hardware/software interaction debugging
Low-level software development
C++ close to hardware
Python for tooling and automation
Performance profiling and optimization
Nice-to-have
Experience with ML accelerators
Architectural exploration
Small, high-impact team environment
Key Requirements
Experience building firmware, drivers, runtime software, or communication libraries
Comfortable reading hardware specs and translating them into software