XPENG Inc is seeking a Senior Staff AI Data Infrastructure Engineer in Santa Clara, CA, to join their AI Infrastructure team. The role involves building scalable data pipelines and optimizing data processing for autonomous vehicle systems
Job Summary
You will architect and build scalable end-to-end pipelines to automate the ingestion and processing of PB-scale raw data for production autonomy and multi-modal LLMs.
The role involves evolving data storage solutions based on Apache Iceberg and Lance to implement efficient semantic indexing and metadata management.
Your work directly determines how self-driving systems learn from massive datasets by optimizing data loading strategies for large-scale training on 10,000+ GPU clusters.
Matching Summary
Match Score: 85
XPENG Inc is seeking a Senior Staff AI Data Infrastructure Engineer in Santa Clara, CA, to join their AI Infrastructure team. The role involves building scalable data pipelines and optimizing data processing for autonomous vehicle systems.
Skills & Requirements
Must-have
PB-scale raw data processing
Apache Iceberg and Lance architecture
Python C++ or Java programming
Ray and Spark distributed frameworks
10,000+ GPU cluster optimization
Nice-to-have
High-performance concurrent programming
Semantic indexing expertise
Multi-modal model experience
Autonomous driving domain knowledge
Key Requirements
7+ years of industry experience
BS/MS/PhD in Computer Science
Proven track record building large-scale distributed systems