Research Scientist - Speech & Audio Understanding (speech Generation)

Tencent Cloud

Bellevue, Washington, US
Base: $122,500.00 to $229,700.00 py; bonus/equity:...
Speech generation algorithms
Voice foundation models
Deep learning frameworks
TEG provides users with a full range of customer services and leads infrastructure R&D through open source collaboration

Job Summary

  • TEG provides users with a full range of customer services and leads infrastructure R&D through open source collaboration.
  • The role involves tracking the latest research in speech generation, exploring next-generation paradigms, and pushing the boundaries of speech generation capabilities.
  • Employees are eligible for sign on payments, relocation packages, restricted stock units, and comprehensive benefits including medical, dental, vision, life and disability insurance, 401(k) plan, and paid time off.

Matching Summary

TEG provides users with a full range of customer services and leads infrastructure R&D through open source collaboration.

Salary

Base: $122,500.00 to $229,700.00 per year; Bonus/Equity: sign on payment and restricted stock units possible; Benefits: medical, dental, vision, life and disability insurance, 401(k), paid vacation, holidays, and sick leave

Skills & Requirements

Must-have

  • speech generation algorithms
  • voice foundation models
  • deep learning frameworks
  • large-scale model training
  • multimodal voice technologies

Nice-to-have

  • large model architecture understanding
  • innovative application development
  • distributed open source collaboration

Key Requirements

  • Master’s or Ph.D. in related fields
  • experience in speech synthesis or recognition
  • familiarity with voice-enabled large models
  • proficiency in PyTorch
  • experience with large-scale pretraining or post-training

Work Rights

Not specified

Tailored Resume

Cover Letter