Senior Staff Ai Data Infrastructure Engineer

XPENG Inc

Santa Clara, CA, United States
On-site
Pb-scale raw data processing
Apache iceberg and lance architecture
Python c++ or java programming
XPENG Inc is seeking a Senior Staff AI Data Infrastructure Engineer in Santa Clara, CA, to join their AI Infrastructure team. The role involves building scalable data pipelines and optimizing data processing for autonomous vehicle systems

Job Summary

  • You will architect and build scalable end-to-end pipelines to automate the ingestion and processing of PB-scale raw data for production autonomy and multi-modal LLMs.
  • The role involves evolving data storage solutions based on Apache Iceberg and Lance to implement efficient semantic indexing and metadata management.
  • Your work directly determines how self-driving systems learn from massive datasets by optimizing data loading strategies for large-scale training on 10,000+ GPU clusters.

Matching Summary

Match Score: 85

XPENG Inc is seeking a Senior Staff AI Data Infrastructure Engineer in Santa Clara, CA, to join their AI Infrastructure team. The role involves building scalable data pipelines and optimizing data processing for autonomous vehicle systems.

Skills & Requirements

Must-have

  • PB-scale raw data processing
  • Apache Iceberg and Lance architecture
  • Python C++ or Java programming
  • Ray and Spark distributed frameworks
  • 10,000+ GPU cluster optimization

Nice-to-have

  • High-performance concurrent programming
  • Semantic indexing expertise
  • Multi-modal model experience
  • Autonomous driving domain knowledge

Key Requirements

  • 7+ years of industry experience
  • BS/MS/PhD in Computer Science
  • Proven track record building large-scale distributed systems

Work Rights

Not specified

Tailored Resume

Cover Letter