Senior Machine Learning Infrastructure Engineer

PlusAI

Santa Clara, CA, US
On-site
Scalable architecture design
Large-scale gpu cluster management
Docker container orchestration
PlusAI is seeking a Senior Machine Learning Infrastructure Engineer to design scalable architectures for autonomous truck technology. The ideal candidate will possess expertise in managing GPU clusters and building robust ML pipelines, while thriving in a dynamic, innovative environment

Job Summary

  • PlusAI is pioneering AI-based virtual driver software for factory-built autonomous trucks in partnership with major automotive brands.
  • The role involves designing scalable architectures capable of handling petabytes of data while ensuring optimal performance for training and inference.
  • Candidates will build robust pipelines for model versioning and experiment tracking to maintain reproducibility across experiments.

Matching Summary

Match Score: 85

PlusAI is seeking a Senior Machine Learning Infrastructure Engineer to design scalable architectures for autonomous truck technology. The ideal candidate will possess expertise in managing GPU clusters and building robust ML pipelines, while thriving in a dynamic, innovative environment.

Skills & Requirements

Must-have

  • Scalable architecture design
  • Large-scale GPU cluster management
  • Docker container orchestration
  • Kubernetes cluster integration
  • PyTorch or TensorFlow frameworks

Nice-to-have

  • Cloud-native technology expertise
  • Passion for solving challenging problems
  • Experience with experiment tracking frameworks

Key Requirements

  • Senior level experience
  • Proficiency in deep learning frameworks

Work Rights

Not specified

Tailored Resume

Cover Letter