Staff Software Engineer, Inference Cloud

Cerebras Systems

Sunnyvale, CA, United States
On-site
Distributed systems problems
Multi-region traffic architecture
Graceful degradation under bursty ai workloads
Cerebras Systems is looking for a Staff Software Engineer to lead the architecture of their Inference Cloud Platform in Sunnyvale, CA. The ideal candidate will tackle complex distributed systems challenges while contributing to the development of a globally distributed AI inference platform

Job Summary

  • This team owns the cloud layer behind our Inference Service, with responsibility for availability, latency, reliability, and global scale.
  • This is a hands on IC role for an engineer who wants to work on the hardest distributed systems problems in the stack: multi-region traffic architecture, graceful degradation under bursty AI workloads, performance at high QPS, and the operating model for a platform that has to stay fast and available under load.
  • You'll write code, lead key architectural decisions in your domain, debug production issues, and help shape technical direction across adjacent teams.

Matching Summary

Match Score: 85

Cerebras Systems is looking for a Staff Software Engineer to lead the architecture of their Inference Cloud Platform in Sunnyvale, CA. The ideal candidate will tackle complex distributed systems challenges while contributing to the development of a globally distributed AI inference platform.

Skills & Requirements

Must-have

  • distributed systems problems
  • multi-region traffic architecture
  • graceful degradation under bursty AI workloads
  • performance at high QPS
  • operating model for a platform

Nice-to-have

  • next-generation architecture
  • globally distributed inference platform

Key Requirements

  • Staff Engineer level experience
  • Experience with distributed systems
  • Experience with cloud platforms

Work Rights

Not specified

Tailored Resume

Cover Letter