The role involves architecting and operating scalable backend services for a global media intelligence platform that processes text, image, video, and audio content
Job Summary
The role involves architecting and operating scalable backend services for a global media intelligence platform that processes text, image, video, and audio content.
Candidates will lead the optimization of AI/ML inference workflows to ensure low latency, high throughput, and cost efficiency across real-time and batch processing paths.
This position offers the opportunity to work with a global talent powerhouse in the fintech space, collaborating on cutting-edge blockchain and digital finance solutions.
Matching Summary
The role involves architecting and operating scalable backend services for a global media intelligence platform that processes text, image, video, and audio content.
Skills & Requirements
Must-have
5-7+ years backend engineering experience
3+ years AI/ML inference integration
Python or Node.js expertise
Distributed system architecture design
Vector search and semantic retrieval
AWS/GCP cloud infrastructure deployment
Production-grade moderation pipeline development
Nice-to-have
Experience with vLLM or Triton model serving
Knowledge of PyTorch and TensorFlow frameworks
Mentoring junior engineers
Optimization of GPU utilization
Multimodal model benchmarking skills
Event-driven workflow patterns
Key Requirements
Bachelor's degree in Computer Science or equivalent
5-7+ years of backend engineering experience
3+ years integrating AI/ML systems into production