The team builds the platform that turns enormous amounts of compute into a reliable engine for frontier AI by connecting accelerators, CPUs, networks, and storage
Job Summary
The team builds the platform that turns enormous amounts of compute into a reliable engine for frontier AI by connecting accelerators, CPUs, networks, and storage.
Engineers will design, provision, schedule, operate, and optimize systems ranging from bare-metal automation to high-performance networking and fleet health.
Candidates are expected to reason carefully about complex systems, write durable software, and raise the quality and velocity of the people around them.
Matching Summary
The team builds the platform that turns enormous amounts of compute into a reliable engine for frontier AI by connecting accelerators, CPUs, networks, and storage.
Skills & Requirements
Must-have
Distributed systems experience
High-performance computing knowledge
Kubernetes and scheduling expertise
Hardware-aware performance optimization
Complex system debugging skills
Nice-to-have
Strong engineering judgment
Excitement about frontier AI
Ability to work across stack layers
Bias toward practical durable solutions
Interest in developer experience
Key Requirements
Strong software engineering skills
Experience building production infrastructure systems