Member Of Technical Staff - Inference

x.ai

Palo Alto, CA, United States
Base: $180,000 - $440,000 usd; bonus/equity: equit...
On-site
System optimizations for model serving
Low-level optimizations for inference
Algorithmic optimizations for inference
x.ai is seeking a Member of Technical Staff for their inference team in Palo Alto, CA. The role involves optimizing model inference for high-performance production systems, requiring strong technical expertise in system and algorithmic optimizations

Job Summary

  • xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.
  • Optimizing the latency and throughput of model inference and building reliable and performant production serving systems to serve billions of users.
  • Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

Matching Summary

Match Score: 85

x.ai is seeking a Member of Technical Staff for their inference team in Palo Alto, CA. The role involves optimizing model inference for high-performance production systems, requiring strong technical expertise in system and algorithmic optimizations.

Salary

Base: $180,000 - $440,000 USD; Bonus/Equity: equity; Benefits: comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks

Skills & Requirements

Must-have

  • system optimizations for model serving
  • low-level optimizations for inference
  • algorithmic optimizations for inference
  • large-scale inference engines
  • large-scale production serving
  • testing inference services

Nice-to-have

  • engineering excellence
  • strong prioritization skills
  • strong communication skills
  • hands-on contribution

Key Requirements

  • System optimizations for model serving
  • Low-level optimizations for inference
  • Algorithmic optimizations for inference
  • Large-scale inference engines or RL frameworks
  • Large-scale, high-concurrent production serving
  • Testing, benchmarking, and reliability of inference services

Work Rights

Not specified

Tailored Resume

Cover Letter