The role involves architecting and operating scalable backend services for a global media intelligence platform that processes text, image, video, and audio content
Job Summary
The role involves architecting and operating scalable backend services for a global media intelligence platform that processes text, image, video, and audio content.
Candidates will lead the development of AI/ML inference workflows, optimizing for latency, throughput, and cost using techniques like quantization and batching.
This position offers the opportunity to collaborate with a global talent powerhouse in the fintech space, focusing on secure and transparent digital finance solutions.
Matching Summary
The role involves architecting and operating scalable backend services for a global media intelligence platform that processes text, image, video, and audio content.
Skills & Requirements
Must-have
5-7+ years backend engineering experience
3+ years AI/ML integration in production
Python or Node.js expertise
Distributed system architecture design
Vector search and semantic retrieval
AWS/GCP cloud infrastructure deployment
Nice-to-have
Experience with vLLM or Triton model serving
Knowledge of HuggingFace transformers ecosystem
Background in video/audio processing pipelines
Mentoring junior engineers
Experience with PyTorch and TensorFlow
Key Requirements
Bachelor's degree in Computer Science or equivalent