Centific Global is seeking a PhD Research Intern to work on speech AI models, specifically focusing on spoken language models that interact conversationally. This remote internship offers competitive stipends, mentorship, and the opportunity to work on impactful projects within a collaborative environment
Job Summary
Centific AI Research seeks a PhD Research Intern to design and evaluate speech-first models, with a focus on Spoken Language Models (SLMs) that reason over audio and interact conversationally.
The role involves end-to-end speech dialogue systems, alignment between speech encoders and text backbones, and efficient speech tokenization for long-form audio.
Interns will work with scientists and engineers to deliver measurable impact, with opportunities for publication and presentation.
Matching Summary
Match Score: 85
Centific Global is seeking a PhD Research Intern to work on speech AI models, specifically focusing on spoken language models that interact conversationally. This remote internship offers competitive stipends, mentorship, and the opportunity to work on impactful projects within a collaborative environment.
Salary
$35-$45 Hourly
Skills & Requirements
Must-have
Speech dialogue systems
Speech-aware LLMs
Efficient speech tokenization
Latency-aware inference
Python and PyTorch fluency
Modern sequence models
Nice-to-have
Speech generation experience
Multilingual speech background
Safety and bias evaluation
Distributed training experience
Key Requirements
PhD candidate in CS/EE or related field
Research in speech, audio ML, or multimodal LMs
Hands-on GPU training experience
Familiarity with torchaudio or librosa
Depth in discrete speech tokens/temporal compression, modality alignment, or post-training for speech tasks