Ai Intern – Vla Deployment

XPENG Inc.

Santa Clara, CA, United States
On-site
Strong c++ and python programming skills
Familiarity with pytorch deep learning framework
Understanding of model inference and deployment workflows
The role focuses on optimizing and deploying Vision-Language-Action models onto vehicle-grade compute platforms for real-time autonomous driving

Job Summary

  • The role focuses on optimizing and deploying Vision-Language-Action models onto vehicle-grade compute platforms for real-time autonomous driving.
  • Candidates will support model quantization, pruning, and compression techniques under the guidance of senior engineers.
  • The position requires collaboration with cross-functional teams to ensure stable deployment in both vehicle and simulation environments.

Matching Summary

The role focuses on optimizing and deploying Vision-Language-Action models onto vehicle-grade compute platforms for real-time autonomous driving.

Skills & Requirements

Must-have

  • Strong C++ and Python programming skills
  • Familiarity with PyTorch deep learning framework
  • Understanding of model inference and deployment workflows
  • Knowledge of ONNX or TensorRT frameworks
  • Exposure to INT8 or FP16 quantization concepts

Nice-to-have

  • Experience with CUDA or GPU programming
  • Background in Transformers or multimodal models
  • Interest in computer architecture and edge systems
  • Previous internship in embedded AI or inference acceleration
  • Contributions to open-source repositories or research projects

Key Requirements

  • BS, MS, or PhD in Computer Science, Electrical Engineering, Robotics, or related field
  • Strong problem-solving skills in a fast-paced engineering environment
  • Good communication skills for cross-functional collaboration

Work Rights

Not specified

Tailored Resume

Cover Letter