Member Of Technical Staff - Inference

xAI

Palo Alto, CA, United States
$180,000 - $440,000 usd; not specified; base salar...
On-site
Optimizing model inference latency
Building production serving systems
Low-level inference optimizations
xAI is seeking a Member of Technical Staff focused on optimizing AI model inference in a flat organizational structure that encourages initiative and hands-on contributions. The ideal candidate will have experience in system and algorithmic optimizations for large-scale inference engines and production systems. This position offers competitive compensation and a comprehensive benefits package

Job Summary

  • xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.
  • Responsibilities include optimizing the latency and throughput of model inference and building reliable and performant production serving systems.
  • The total rewards package includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, and other benefits.

Matching Summary

Match Score: 85

xAI is seeking a Member of Technical Staff focused on optimizing AI model inference in a flat organizational structure that encourages initiative and hands-on contributions. The ideal candidate will have experience in system and algorithmic optimizations for large-scale inference engines and production systems. This position offers competitive compensation and a comprehensive benefits package.

Salary

$180,000 - $440,000 USD; Not specified; Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

Skills & Requirements

Must-have

  • optimizing model inference latency
  • building production serving systems
  • low-level inference optimizations
  • large-scale inference engines
  • high-concurrent production serving
  • testing inference services reliability

Nice-to-have

  • engineering excellence focus
  • flat organizational structure
  • hands-on contribution expected
  • strong prioritization skills
  • concise knowledge sharing

Key Requirements

  • system optimizations for model serving
  • low-level optimizations for inference
  • algorithmic optimizations for inference
  • large-scale inference engines or RL frameworks
  • large-scale, high-concurrent production serving
  • testing, benchmarking, reliability of inference services

Work Rights

Not specified

Tailored Resume

Cover Letter