Workload Optimization Intern

Intel

Shanghai, China
On-site
C++ and python programming proficiency
Deep learning fundamentals understanding
Gpu kernel development experience
Intel is seeking a Graduate Technical Intern for its AI Engineering team in Shanghai, focusing on workload optimization and deep learning solutions. The role involves performance optimization, deployment architecture design, and kernel development, requiring strong programming skills and a background in AI

Job Summary

  • Intel is seeking a Graduate Technical Intern to help deliver high-performance deep learning solutions as part of its expanding AI Engineering team.
  • The role involves optimizing key use cases, debugging accuracy issues, and designing model deployment frameworks leveraging new features in vLLM.
  • Candidates must be available for a minimum of 4 days per week with a commitment of 6 months or longer at the Shanghai location.

Matching Summary

Match Score: 85

Intel is seeking a Graduate Technical Intern for its AI Engineering team in Shanghai, focusing on workload optimization and deep learning solutions. The role involves performance optimization, deployment architecture design, and kernel development, requiring strong programming skills and a background in AI.

Skills & Requirements

Must-have

  • C++ and Python programming proficiency
  • Deep learning fundamentals understanding
  • GPU Kernel development experience

Nice-to-have

  • Experience with LLMs and multimodal models
  • Familiarity with vLLM inference frameworks
  • PyTorch framework expertise

Key Requirements

  • Current Master's or Ph.D. student status
  • Minimum 4 days per week availability
  • 6 months or longer commitment duration

Work Rights

Not specified

Tailored Resume

Cover Letter