Research Internship- Multimodal Llm (speech/music/audio/vision/language)
Tencent Music Entertainment Group
Bellevue, Washington, US
Base: $80,168.40 to $124,800.00 py; bonus/equity: ...
**
Ph.d. student in computer science or related field
Proficiency in python and c++ programming
Experience with deep learning toolkits
**
Tencent Music Entertainment Group is seeking a research intern for its AI Lab in Bellevue, Washington, to work on multimodal large language models focusing on speech, music, audio, vision, and language processing. The ideal candidate is a Ph.D. student with relevant research experience and programming skills, who is passionate about developing innovative AI techniques.
**
Job Summary
The role involves working with researchers to attack core problems by inventing cutting-edge techniques in multimodal AI.
Interns are encouraged to publish their results from the internship at top conferences and journals.
The position offers eligibility for paid sick leave, 13 paid holidays, and enrollment in a company-sponsored medical plan.
Matching Summary
Match Score: 75
**
Tencent Music Entertainment Group is seeking a research intern for its AI Lab in Bellevue, Washington, to work on multimodal large language models focusing on speech, music, audio, vision, and language processing. The ideal candidate is a Ph.D. student with relevant research experience and programming skills, who is passionate about developing innovative AI techniques.
**
Salary
Base: $80,168.40 to $124,800.00 per year; Bonus/Equity: Not specified; Benefits: Paid sick leave, 13 paid holidays, medical plan eligibility
Skills & Requirements
Must-have
Ph.D. student in computer science or related field
Proficiency in Python and C++ programming
Experience with deep learning toolkits
Background in NLP, speech, audio, or computer vision