Speech Algorithm Engineer

Tencent Music Entertainment Group

Not specified
Python or c++ programming proficiency
Pytorch megatron or deepspeed frameworks
Speech dialogue synthesis recognition experience
Tencent Music Entertainment Group is seeking a Speech Algorithm Engineer to focus on developing and optimizing large speech/audio models. The ideal candidate will have experience in speech dialogue, synthesis, and recognition, alongside strong programming skills

Job Summary

  • The role involves researching and developing speech and audio large models for dialogue, understanding, and generation tasks.
  • Candidates must possess strong coding skills in Python or C++ along with familiarity with model training frameworks like PyTorch.
  • The position requires overseeing the open-sourcing of models and optimizing end-to-end pipelines for various audio applications.

Matching Summary

Match Score: 85

Tencent Music Entertainment Group is seeking a Speech Algorithm Engineer to focus on developing and optimizing large speech/audio models. The ideal candidate will have experience in speech dialogue, synthesis, and recognition, alongside strong programming skills.

Skills & Requirements

Must-have

  • Python or C++ programming proficiency
  • PyTorch Megatron or DeepSpeed frameworks
  • Speech dialogue synthesis recognition experience
  • Data structures and algorithms foundation

Nice-to-have

  • ACM ICPC NOI IOI Top Coder Kaggle awards
  • Publications in NeurIPS ICLR ICML ACL CVPR
  • Strong motivation curiosity and teamwork spirit
  • Experience with audio-video multimodality

Key Requirements

  • Prior experience in speech dialogue or large language models
  • Solid background in mathematics and signal processing
  • Good reading ability for English technical literature

Work Rights

Not specified

Tailored Resume

Cover Letter