Workload Optimization Intern

INTEL

Shanghai, China
On-site
C++ and python programming proficiency
Deep learning fundamentals knowledge
Master's or ph.d. student status
Intel is seeking a Graduate Technical Intern for its AI Engineering team in Shanghai, focusing on performance optimization and deep learning solutions. Ideal candidates should have a strong background in computer science or related fields, with proficiency in C++ and Python, and a solid understanding of deep learning fundamentals

Job Summary

  • The role focuses on optimizing key use cases and models while debugging issues related to accuracy and memory management.
  • Candidates will design and develop model deployment frameworks leveraging new features in vLLM to accelerate inference.
  • The position requires a commitment of at least four days per week for a duration of six months or longer.

Matching Summary

Match Score: 85

Intel is seeking a Graduate Technical Intern for its AI Engineering team in Shanghai, focusing on performance optimization and deep learning solutions. Ideal candidates should have a strong background in computer science or related fields, with proficiency in C++ and Python, and a solid understanding of deep learning fundamentals.

Skills & Requirements

Must-have

  • C++ and Python programming proficiency
  • Deep Learning fundamentals knowledge
  • Master's or Ph.D. student status

Nice-to-have

  • Experience with LLMs and multimodal models
  • Familiarity with PyTorch and vLLM frameworks
  • Hands-on GPU Kernel development experience

Key Requirements

  • Current Master's or Ph.D. student in CS or AI
  • Minimum 4 days per week availability
  • 6 months or longer commitment required

Work Rights

Not specified

Tailored Resume

Cover Letter