Software Engineer, Kernel Reliability

Cerebras Systems

Sunnyvale, California, USA
On-site
Strong c/c++ and python programming
Operating systems fundamentals
Computer architecture fundamentals
Cerebras Systems is seeking a Software Engineer for their Kernel Reliability team to improve the reliability of advanced compute clusters and internal services. The ideal candidate should have strong programming skills in C/C++ and Python, with a foundation in operating systems and debugging

Job Summary

  • Contribute to the technical roadmap and execution for kernel-centric reliability of our internal and customer-facing systems.
  • Partner with System and Cluster Operations teams to reduce system and service downtime after failure through tooling, analysis, and hands-on debugging support.
  • Work with ASIC and hardware architecture teams to co-design next-generation architectures with reliability and ease of debug in mind.

Matching Summary

Match Score: 85

Cerebras Systems is seeking a Software Engineer for their Kernel Reliability team to improve the reliability of advanced compute clusters and internal services. The ideal candidate should have strong programming skills in C/C++ and Python, with a foundation in operating systems and debugging.

Skills & Requirements

Must-have

  • strong C/C++ and Python programming
  • operating systems fundamentals
  • computer architecture fundamentals
  • systems programming fundamentals
  • debugging complex issues
  • root-cause analysis

Nice-to-have

  • parallel and distributed programming
  • building debug/diagnostic tools
  • debugging distributed applications
  • computer architecture concepts
  • incident response and monitoring

Key Requirements

  • New college graduates welcome

Work Rights

Not specified

Tailored Resume

Cover Letter