Hyphen Connect is looking for a Multimodal AI Systems Architect to develop and optimize AI systems that integrate vision and audio models, enhancing voice-to-voice interactions and multimodal retrieval capabilities. The ideal candidate will have experience with specific AI technologies and streaming architectures
Job Summary
The role focuses on developing AI systems that seamlessly integrate vision and audio models to enhance voice-to-voice interactions.
Candidates will be responsible for architecting multimodal RAG systems capable of retrieving insights from videos and PDFs.
This position requires optimizing streaming latency to ensure efficient and innovative AI performance.
Matching Summary
Match Score: 85
Hyphen Connect is looking for a Multimodal AI Systems Architect to develop and optimize AI systems that integrate vision and audio models, enhancing voice-to-voice interactions and multimodal retrieval capabilities. The ideal candidate will have experience with specific AI technologies and streaming architectures.