Research Scientist - Speech & Audio Understanding (speech Generation)

Tencent Music Entertainment Group

Bellevue, Washington, US
Base: $122,500.00 to $229,700.00 py; bonus/equity:...
**
Master's or ph.d. in computer science
Experience with voice foundation models
Proficiency in pytorch deep learning framework
** Tencent Music Entertainment Group is seeking a Research Scientist specializing in Speech & Audio Understanding, focusing on advancing speech generation and multimodal voice technologies. The role requires a strong background in AI, computer science, or related fields, with expertise in voice foundation models and deep learning frameworks. **

Job Summary

  • The role involves tracking the latest research in speech generation algorithms and exploring next-generation paradigms for audio generation.
  • Candidates will lead technical R&D of voice foundation models to enhance voice interaction experiences by integrating text, speech, and vision.
  • Employees are eligible for a sign-on payment, relocation package, restricted stock units, and up to 25 days of vacation per year.

Matching Summary

Match Score: 75

** Tencent Music Entertainment Group is seeking a Research Scientist specializing in Speech & Audio Understanding, focusing on advancing speech generation and multimodal voice technologies. The role requires a strong background in AI, computer science, or related fields, with expertise in voice foundation models and deep learning frameworks. **

Salary

Base: $122,500.00 to $229,700.00 per year; Bonus/Equity: Sign-on payment and restricted stock units available; Benefits: Medical, dental, vision, life, disability, 401(k), and paid leave included

Skills & Requirements

Must-have

  • Master's or Ph.D. in Computer Science
  • Experience with voice foundation models
  • Proficiency in PyTorch deep learning framework

Nice-to-have

  • Experience with Megatron/Deepspeed frameworks
  • Familiarity with GPT4o or GLM-4-Voice models
  • Prior project experience in audio generation

Key Requirements

  • Master's or Ph.D. degree required
  • Background in Signal Processing or Electronic Engineering
  • Solid understanding of large model architectures

Work Rights

Not specified

Tailored Resume

Cover Letter