The role involves architecting and operating scalable backend services for a global media intelligence platform that processes text, image, video, and audio content
Job Summary
The role involves architecting and operating scalable backend services for a global media intelligence platform that processes text, image, video, and audio content.
Candidates will lead the optimization of AI/ML inference workflows to ensure low latency and high throughput while managing costs across distributed cloud environments.
Tether offers a remote-first culture where engineers collaborate globally to pioneer innovations in digital finance, sustainable energy, and secure data sharing.
Matching Summary
The role involves architecting and operating scalable backend services for a global media intelligence platform that processes text, image, video, and audio content.
Skills & Requirements
Must-have
5-7+ years backend engineering experience
3+ years AI/ML inference integration
Python or Node.js expertise
Distributed system architecture design
Vector search and semantic retrieval
Media processing pipeline development
Nice-to-have
Experience with vLLM or Triton model serving
Knowledge of quantization and model distillation
Background in video/audio analysis
Mentoring junior engineers
Familiarity with HuggingFace ecosystem
Key Requirements
Bachelor's degree in Computer Science or equivalent
5-7+ years of backend engineering experience
3+ years integrating AI/ML into production workflows