Intel Corporation is seeking a Graduate Technical Intern for their AI Engineering team in Shanghai, focusing on workload optimization through deep learning solutions. The role requires strong programming skills in C++ and Python, as well as a solid foundation in deep learning and model architecture
Job Summary
The role focuses on optimizing key use cases and models while debugging issues related to accuracy and memory management.
Candidates will design and develop model deployment frameworks leveraging new features in vLLM to accelerate inference.
The position requires a commitment of at least 4 days per week for a duration of 6 months or longer.
Matching Summary
Match Score: 85
Intel Corporation is seeking a Graduate Technical Intern for their AI Engineering team in Shanghai, focusing on workload optimization through deep learning solutions. The role requires strong programming skills in C++ and Python, as well as a solid foundation in deep learning and model architecture.
Skills & Requirements
Must-have
C++ and Python programming proficiency
Deep Learning fundamentals knowledge
Master's or Ph.D. student status
Nice-to-have
Experience with LLMs and Multimodal models
Familiarity with PyTorch and vLLM frameworks
Hands-on GPU Kernel development experience
Key Requirements
Current Master's or Ph.D. student in CS, AI, or Software Engineering