Staff / Senior Software Engineer, Cloud Inference

Anthropic

San Francisco, CA, USA
$300,000—$485,000 usd py
On-site
Large-scale distributed systems serving
Cloud platform experience (aws, gcp, azure)
Kubernetes and container orchestration
The Cloud Inference team scales and optimizes Claude to serve massive audiences across major cloud providers, owning the end-to-end product on each platform

Job Summary

  • The Cloud Inference team scales and optimizes Claude to serve massive audiences across major cloud providers, owning the end-to-end product on each platform.
  • Engineers on this team are highly leveraged, driving revenue streams while optimizing compute resources and navigating complex multi-cloud environments.
  • Work will increase service scale, accelerate new model launches, and ensure LLMs meet rigorous safety, performance, and security standards across all platforms.

Matching Summary

The Cloud Inference team scales and optimizes Claude to serve massive audiences across major cloud providers, owning the end-to-end product on each platform.

Salary

$300,000—$485,000 USD

Skills & Requirements

Must-have

  • Large-scale distributed systems serving
  • Cloud platform experience (AWS, GCP, Azure)
  • Kubernetes and container orchestration
  • Infrastructure as Code
  • CI/CD automation systems design
  • Cost-effective inference management
  • Capacity planning and autoscaling

Nice-to-have

  • Cross-functional collaboration with partners
  • Fast learner on new technologies
  • Highly autonomous and self-driven
  • Platform-agnostic tooling development
  • LLM inference optimization strategies
  • Machine learning infrastructure experience

Key Requirements

  • Significant software engineering experience
  • Experience building/operating services on major cloud platforms
  • Strong interest in inference
  • Bachelor’s degree or equivalent experience

Work Rights

Not specified

Sponsorship: available

Tailored Resume

Cover Letter