Member Of Technical Staff — Kernel / Compiler / Communication

Radixark

Palo Alto, CA, United States
Competitive compensation; meaningful equity includ...
On-site
5+ years systems or compiler engineering experience
Strong cuda or accelerator programming expertise
Deep understanding of gpu architecture and memory hierarchy
RadixArk is seeking a Member of Technical Staff to enhance performance in frontier AI systems by working on kernels, compilers, and communication libraries. The ideal candidate will possess extensive experience in systems programming and GPU architecture to optimize AI workloads

Job Summary

  • RadixArk is seeking a Member of Technical Staff to push the limits of performance for frontier AI systems by working at the lowest layers of the stack.
  • This role is critical to scaling training and inference across thousands of GPUs, where microseconds and memory bandwidth matter directly to the performance envelope.
  • The company offers competitive compensation with meaningful equity, comprehensive benefits, and flexible work arrangements.

Matching Summary

Match Score: 88

RadixArk is seeking a Member of Technical Staff to enhance performance in frontier AI systems by working on kernels, compilers, and communication libraries. The ideal candidate will possess extensive experience in systems programming and GPU architecture to optimize AI workloads.

Salary

Competitive compensation; Meaningful equity included; Comprehensive benefits and flexible work arrangements

Skills & Requirements

Must-have

  • 5+ years systems or compiler engineering experience
  • Strong CUDA or accelerator programming expertise
  • Deep understanding of GPU architecture and memory hierarchy
  • Experience writing or optimizing high-performance kernels
  • Strong background in compilers, runtimes, or code generation
  • Experience with distributed communication libraries like NCCL or MPI
  • Proficiency in C++ and Python programming languages

Nice-to-have

  • Experience with Triton, TVM, XLA, or MLIR frameworks
  • Experience building compiler passes or IR transformations
  • Familiarity with NVLink, InfiniBand, or RDMA technologies
  • Experience optimizing collective communication at scale
  • Background in HPC or performance-critical systems
  • Contributions to kernel/compiler/ML systems open source
  • Experience scaling workloads to 1000+ GPUs

Key Requirements

  • 5+ years of experience in systems, compiler, or performance engineering
  • Solid knowledge of networking and interconnect technologies
  • Strong debugging and profiling skills at system level

Work Rights

Not specified

Tailored Resume

Cover Letter