Senior Machine Learning Engineer, Multimodal Perception (llm/vlm)

Waymo

Mountain View, CA, United States
Base: $213,000 - $263,000 usd; bonus/equity: discr...
On-site
4+ years applied industry experience
Python or c++ fluency with pytorch/jax
Multimodal foundation model deployment experience
Waymo is seeking a Senior Machine Learning Engineer specializing in multimodal perception for their autonomous driving technology team. The ideal candidate will have extensive experience in machine learning, particularly with multimodal and vision-language models, and will contribute to building advanced perception systems for vehicle safety

Job Summary

  • The Semantics team focuses on bringing the immense reasoning power of massive foundation models directly onto the Waymo Driver to handle complex long-tail scenarios.
  • You will architect and train large-scale, onboard ML perception models that are instrumental to ensuring vehicle safety and regulatory compliance.
  • Waymo employees are eligible to participate in a discretionary annual bonus program, equity incentive plan, and generous Company benefits program.

Matching Summary

Match Score: 85

Waymo is seeking a Senior Machine Learning Engineer specializing in multimodal perception for their autonomous driving technology team. The ideal candidate will have extensive experience in machine learning, particularly with multimodal and vision-language models, and will contribute to building advanced perception systems for vehicle safety.

Salary

Base: $213,000 - $263,000 USD; Bonus/Equity: Discretionary annual bonus and equity incentive plan; Benefits: Generous Company benefits program

Skills & Requirements

Must-have

  • 4+ years applied industry experience
  • Python or C++ fluency with PyTorch/Jax
  • Multimodal Foundation Model deployment experience

Nice-to-have

  • PhD in Computer Vision or Machine Learning
  • First-author publications in premier conferences
  • Experience with RLHF for Foundation Models

Key Requirements

  • BS or MS in Computer Vision, Machine Learning, Robotics, or related field
  • Deep understanding of model distillation frameworks and quantization techniques
  • Proven practical experience with Vision-Language Models (VLMs)

Work Rights

Not specified

Tailored Resume

Cover Letter