The role involves architecting and operating scalable backend services for a media intelligence platform that processes large volumes of multimedia content
Job Summary
The role involves architecting and operating scalable backend services for a media intelligence platform that processes large volumes of multimedia content.
Candidates will lead the development of production-grade moderation pipelines using open-source models and optimize AI/ML inference for latency and cost.
Tether offers a global remote work environment where team members collaborate on pioneering financial revolution solutions.
Matching Summary
The role involves architecting and operating scalable backend services for a media intelligence platform that processes large volumes of multimedia content.
Skills & Requirements
Must-have
5-7+ years backend engineering experience
3+ years AI/ML inference integration
Python or Node.js expertise
Distributed system architecture design
Vector search and semantic retrieval
AWS/GCP cloud deployment
Production model optimization
Nice-to-have
Experience with vLLM or Triton serving
Mentoring junior engineers
Multimodal model integration
Event-driven workflow patterns
Knowledge of HuggingFace ecosystem
Key Requirements
Bachelor's degree in Computer Science or equivalent
Strong English communication skills
Proven track record owning backend modules end-to-end