Workload Optimization Intern

Intel Corporation

Shanghai, China
Onsite
C++ and python programming proficiency
Deep learning fundamentals knowledge
Master's or ph.d. student status
Intel Corporation is seeking a Graduate Technical Intern for their AI Engineering team in Shanghai, focusing on workload optimization through deep learning solutions. The role requires strong programming skills in C++ and Python, as well as a solid foundation in deep learning and model architecture

Job Summary

  • The role focuses on optimizing key use cases and models while debugging issues related to accuracy and memory management.
  • Candidates will design and develop model deployment frameworks leveraging new features in vLLM to accelerate inference.
  • The position requires a commitment of at least 4 days per week for a duration of 6 months or longer.

Matching Summary

Match Score: 85

Intel Corporation is seeking a Graduate Technical Intern for their AI Engineering team in Shanghai, focusing on workload optimization through deep learning solutions. The role requires strong programming skills in C++ and Python, as well as a solid foundation in deep learning and model architecture.

Skills & Requirements

Must-have

  • C++ and Python programming proficiency
  • Deep Learning fundamentals knowledge
  • Master's or Ph.D. student status

Nice-to-have

  • Experience with LLMs and Multimodal models
  • Familiarity with PyTorch and vLLM frameworks
  • Hands-on GPU Kernel development experience

Key Requirements

  • Current Master's or Ph.D. student in CS, AI, or Software Engineering
  • Minimum 4 days per week availability
  • 6 months or longer commitment required

Work Rights

Not specified

Tailored Resume

Cover Letter