The Cloud Inference team scales and optimizes Claude to serve massive audiences across major cloud providers, owning the end-to-end product on each platform
Job Summary
The Cloud Inference team scales and optimizes Claude to serve massive audiences across major cloud providers, owning the end-to-end product on each platform.
Engineers on this team are highly leveraged, driving revenue streams while optimizing compute resources and navigating complex multi-cloud environments.
Work will increase service scale, accelerate new model launches, and ensure LLMs meet rigorous safety, performance, and security standards across all platforms.
Matching Summary
The Cloud Inference team scales and optimizes Claude to serve massive audiences across major cloud providers, owning the end-to-end product on each platform.
Salary
$300,000—$485,000 USD
Skills & Requirements
Must-have
Large-scale distributed systems serving
Cloud platform experience (AWS, GCP, Azure)
Kubernetes and container orchestration
Infrastructure as Code
CI/CD automation systems design
Cost-effective inference management
Capacity planning and autoscaling
Nice-to-have
Cross-functional collaboration with partners
Fast learner on new technologies
Highly autonomous and self-driven
Platform-agnostic tooling development
LLM inference optimization strategies
Machine learning infrastructure experience
Key Requirements
Significant software engineering experience
Experience building/operating services on major cloud platforms