Software Engineer, Compute Infrastructure

OpenAI

San Francisco, United States
Remote
Distributed systems experience
High-performance computing knowledge
Kubernetes and scheduling expertise
The team builds the platform that turns enormous amounts of compute into a reliable engine for frontier AI by connecting accelerators, CPUs, networks, and storage

Job Summary

  • The team builds the platform that turns enormous amounts of compute into a reliable engine for frontier AI by connecting accelerators, CPUs, networks, and storage.
  • Engineers will design, provision, schedule, operate, and optimize systems ranging from bare-metal automation to high-performance networking and fleet health.
  • Candidates are expected to reason carefully about complex systems, write durable software, and raise the quality and velocity of the people around them.

Matching Summary

The team builds the platform that turns enormous amounts of compute into a reliable engine for frontier AI by connecting accelerators, CPUs, networks, and storage.

Skills & Requirements

Must-have

  • Distributed systems experience
  • High-performance computing knowledge
  • Kubernetes and scheduling expertise
  • Hardware-aware performance optimization
  • Complex system debugging skills

Nice-to-have

  • Strong engineering judgment
  • Excitement about frontier AI
  • Ability to work across stack layers
  • Bias toward practical durable solutions
  • Interest in developer experience

Key Requirements

  • Strong software engineering skills
  • Experience building production infrastructure systems
  • Ability to debug complex system behavior

Work Rights

Not specified

Tailored Resume

Cover Letter