2026 Summer Intern – Ai/ml Intern – Vision Language Model/action (masters)

General Motors

San Francisco, California, US
Base: $10,600 pm; bonus/equity: not specified; ben...
Hybrid
Vision-language-action architectures
Multimodal perception
Robotic control
Drive the development of embodied foundation models and vision-language-action architectures that unify multimodal perception with robotic control

Job Summary

  • Drive the development of embodied foundation models and vision-language-action architectures that unify multimodal perception with robotic control.
  • Prototype and refine ML models that leverage VLA architectures to improve decision-making and reasoning for autonomous vehicles through imitation and reinforcement learning.
  • Partner with perception, robotics, and systems engineering teams to integrate VLA research into the broader autonomous stack and validate models in closed-loop environments.

Matching Summary

Drive the development of embodied foundation models and vision-language-action architectures that unify multimodal perception with robotic control.

Salary

Base: $10,600 per month; Bonus/Equity: Not specified; Benefits: Paid US GM Holidays, GM Family First Vehicle Discount Program, Intern events

Skills & Requirements

Must-have

  • Vision-Language-Action architectures
  • Multimodal perception
  • Robotic control
  • Machine learning models
  • Python and ML frameworks

Nice-to-have

  • Autonomous vehicles experience
  • Large-scale datasets
  • High-performance computing
  • Foundation models for embodied control

Key Requirements

  • Masters in Machine Learning, AI, CS, or related
  • Solid understanding of modern ML
  • Proficiency in Python
  • Research experience in AI/ML
  • Strong problem-solving skills
  • Strong communication skills
  • Work fulltime, 40 hours per week

Work Rights

Not specified

Tailored Resume

Cover Letter