Member Of Technical Staff - Inference

xgrok-ai.ru

Palo Alto, CA, United States
$180,000 - $440,000 usd; not specified; benefits i...
On-site
Optimizing model inference latency and throughput
Building production serving systems
Low-level inference optimizations
xAI is seeking a Member of Technical Staff specializing in inference to enhance the performance and reliability of AI model serving systems. The role requires deep expertise in system optimizations and algorithmic improvements for large-scale production environments

Job Summary

  • xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.
  • Responsibilities include optimizing the latency and throughput of model inference and building reliable and performant production serving systems.
  • Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

Matching Summary

Match Score: 85

xAI is seeking a Member of Technical Staff specializing in inference to enhance the performance and reliability of AI model serving systems. The role requires deep expertise in system optimizations and algorithmic improvements for large-scale production environments.

Salary

$180,000 - $440,000 USD; Not specified; Benefits included

Skills & Requirements

Must-have

  • Optimizing model inference latency and throughput
  • Building production serving systems
  • Low-level inference optimizations
  • Algorithmic inference optimizations
  • Large-scale inference engines
  • High-concurrent production serving

Nice-to-have

  • Thrive on curiosity
  • Flat organizational structure
  • Hands-on contribution
  • Strong prioritization skills
  • Strong communication skills

Key Requirements

  • System optimizations for model serving
  • Low-level optimizations for inference
  • Algorithmic optimizations for inference
  • Large-scale inference engines or RL frameworks
  • Large-scale, high-concurrent production serving
  • Testing, benchmarking, and reliability of inference services

Work Rights

Not specified

Tailored Resume

Cover Letter