Ml Platform Engineer

avride.ai

Austin, TX, United States
On-site
Python or go proficiency
Kubernetes deep knowledge
Scalable systems design
The ML Platform team builds infrastructure for large-scale ML training and data processing for autonomous driving

Job Summary

  • The ML Platform team builds infrastructure for large-scale ML training and data processing for autonomous driving.
  • You will own critical pieces of the ML stack, shaping how ML teams run experiments and train models at scale.
  • You will build abstractions and services to make training workloads reliable, cost-efficient, and fast on Kubernetes.

Matching Summary

The ML Platform team builds infrastructure for large-scale ML training and data processing for autonomous driving.

Skills & Requirements

Must-have

  • Python or Go proficiency
  • Kubernetes deep knowledge
  • scalable systems design
  • production service operation
  • Linux systems debugging
  • complex production issue troubleshooting

Nice-to-have

  • Argo Workflows, Ray, MLflow experience
  • large-scale ML training systems
  • optimizing distributed resource usage

Key Requirements

  • Authorized to work in the U.S.

Work Rights

Authorized to work in the U.S.

Tailored Resume

Cover Letter