Member Of Technical Staff - Applied Inference

xAI

Palo Alto, CA, United States
Base: $180,000 - $440,000 usd; bonus/equity: equit...
**
Large-scale production serving
Gpu inference engines
Inference services reliability
** xAI is seeking a Member of Technical Staff specialized in Applied Inference to join their Palo Alto team. The ideal candidate will have experience in large-scale production serving, GPU inference engines, and CI/CD infrastructure, and will contribute to the company's mission of developing advanced AI systems. **

Job Summary

  • xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.
  • Architect and implement scalable distributed infrastructure for model serving, such as load balancing, auto scaling, batch scheduling, and global KVcache systems.
  • Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

Matching Summary

Match Score: 75

** xAI is seeking a Member of Technical Staff specialized in Applied Inference to join their Palo Alto team. The ideal candidate will have experience in large-scale production serving, GPU inference engines, and CI/CD infrastructure, and will contribute to the company's mission of developing advanced AI systems. **

Salary

Base: $180,000 - $440,000 USD; Bonus/Equity: equity; Benefits: comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks

Skills & Requirements

Must-have

  • large-scale production serving
  • GPU inference engines
  • inference services reliability
  • CI/CD infrastructure implementation

Nice-to-have

  • engineering excellence
  • strong prioritization skills
  • strong communication skills
  • thrive on curiosity

Key Requirements

  • Worked on large-scale, high-concurrent production serving
  • Worked on GPU inference engines
  • Worked on testing, benchmarking, and the reliability of inference services
  • Worked on designing and implementing CI/CD infrastructure

Work Rights

Not specified

Tailored Resume

Cover Letter