Member Of Technical Staff - Inference

xAI

Palo Alto, CA, United States
Base: $180,000 - $440,000 usd; bonus/equity: equit...
On-site
Model inference optimization
Production serving systems
Low-level inference optimizations
xAI is seeking a Member of Technical Staff - Inference to optimize model inference systems and enhance production serving capabilities. The ideal candidate will possess extensive experience in system and algorithmic optimizations for large-scale inference engines. The position offers a competitive salary and a comprehensive benefits package, reflecting the company's commitment to employee well-being

Job Summary

  • xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.
  • Optimizing the latency and throughput of model inference and building reliable and performant production serving systems to serve billions of users.
  • Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

Matching Summary

Match Score: 85

xAI is seeking a Member of Technical Staff - Inference to optimize model inference systems and enhance production serving capabilities. The ideal candidate will possess extensive experience in system and algorithmic optimizations for large-scale inference engines. The position offers a competitive salary and a comprehensive benefits package, reflecting the company's commitment to employee well-being.

Salary

Base: $180,000 - $440,000 USD; Bonus/Equity: equity; Benefits: comprehensive medical, vision, and dental coverage, 401(k), disability insurance, life insurance, perks

Skills & Requirements

Must-have

  • model inference optimization
  • production serving systems
  • low-level inference optimizations
  • large-scale inference engines
  • high-concurrent production serving

Nice-to-have

  • engineering excellence
  • flat organizational structure
  • strong prioritization skills
  • strong communication skills
  • research acceleration

Key Requirements

  • system optimizations for model serving
  • low-level optimizations for inference
  • algorithmic optimizations for inference
  • large-scale inference engines or RL frameworks
  • large-scale, high-concurrent production serving
  • testing, benchmarking, and reliability

Work Rights

Not specified

Tailored Resume

Cover Letter