Senior Staff Machine Learning Engineer, Data & Eval

Airbnb

United States, US
Base: $244,000—$305,000 usd; bonus/equity: not spe...
Hybrid (us - remote eligible with occasional on-site work)
Genai system evaluation strategy
Data flywheel design and implementation
Ml model productionization at scale
Airbnb is seeking a Senior Staff Machine Learning Engineer for their Core ML team, responsible for developing and optimizing AI models and systems to enhance customer service experiences. The ideal candidate will have extensive experience in machine learning, particularly with generative AI, and will play a crucial role in shaping evaluation strategies and technical direction

Job Summary

  • Set technical direction and lead execution for ML evaluation and the end-to-end data flywheel powering CSxAI products.
  • Your work will define how we measure quality, how we turn feedback into learning signals, and how we continuously improve models and products safely and efficiently.
  • Partner closely with product, engineering, design, operations to build evaluation systems that are trusted, scalable, and actionable.

Matching Summary

Match Score: 85

Airbnb is seeking a Senior Staff Machine Learning Engineer for their Core ML team, responsible for developing and optimizing AI models and systems to enhance customer service experiences. The ideal candidate will have extensive experience in machine learning, particularly with generative AI, and will play a crucial role in shaping evaluation strategies and technical direction.

Salary

Base: $244,000—$305,000 USD; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • GenAI system evaluation strategy
  • Data flywheel design and implementation
  • ML model productionization at scale
  • Cross-functional quality initiatives leadership
  • LLM fine-tuning and optimization

Nice-to-have

  • Customer support workflow AI application
  • Agile practice for applied AI
  • Continuous learning of new techniques

Key Requirements

  • PhD in Computer Science, Mathematics, Statistics, or related technical field (or equivalent practical experience)
  • 10+ years building, testing, and shipping ML/AI systems end-to-end
  • 2+ years of experience with GenAI/LLM systems in production
  • 5+ years leading large, ambiguous technical initiatives as a senior IC
  • Deep expertise in evaluation methodology
  • Hands-on experience with GenAI systems
  • Experience building data pipelines and quality systems

Work Rights

Not specified

Tailored Resume

Cover Letter