Senior Ai Infrastructure Engineer - Training Platform
Scale
San Francisco, CA, US
Base: $216,000 - $270,000 usd; equity: subject to ...
On-site
5+ years backend or infrastructure engineering experience
2+ years orchestrating ml workloads at scale
Expert-level kubernetes internals knowledge
You will architect a high-performance training platform that handles the immense complexity of multi-thousand GPU workloads to ensure every cycle is used efficiently
Job Summary
You will architect a high-performance training platform that handles the immense complexity of multi-thousand GPU workloads to ensure every cycle is used efficiently.
The role involves partnering closely with researchers to build a seamless, resilient environment that transforms raw compute into breakthrough AI models.
Compensation packages include base salary ranging from $216,000 to $270,000 USD plus equity and comprehensive benefits.
Matching Summary
You will architect a high-performance training platform that handles the immense complexity of multi-thousand GPU workloads to ensure every cycle is used efficiently.