Research Scientist - Speech & Audio Understanding (speech Generation)
Tencent Cloud
Bellevue, Washington, US
Base: $122,500.00 to $229,700.00 py; bonus/equity:...
Speech generation algorithms
Voice foundation models
Deep learning frameworks
TEG provides users with a full range of customer services and leads infrastructure R&D through open source collaboration
Job Summary
TEG provides users with a full range of customer services and leads infrastructure R&D through open source collaboration.
The role involves tracking the latest research in speech generation, exploring next-generation paradigms, and pushing the boundaries of speech generation capabilities.
Employees are eligible for sign on payments, relocation packages, restricted stock units, and comprehensive benefits including medical, dental, vision, life and disability insurance, 401(k) plan, and paid time off.
Matching Summary
TEG provides users with a full range of customer services and leads infrastructure R&D through open source collaboration.
Salary
Base: $122,500.00 to $229,700.00 per year; Bonus/Equity: sign on payment and restricted stock units possible; Benefits: medical, dental, vision, life and disability insurance, 401(k), paid vacation, holidays, and sick leave
Skills & Requirements
Must-have
speech generation algorithms
voice foundation models
deep learning frameworks
large-scale model training
multimodal voice technologies
Nice-to-have
large model architecture understanding
innovative application development
distributed open source collaboration
Key Requirements
Master’s or Ph.D. in related fields
experience in speech synthesis or recognition
familiarity with voice-enabled large models
proficiency in PyTorch
experience with large-scale pretraining or post-training