The role involves architecting and operating scalable backend services for a highly scalable media intelligence platform that processes large volumes of multimedia content
Job Summary
The role involves architecting and operating scalable backend services for a highly scalable media intelligence platform that processes large volumes of multimedia content.
Candidates will lead the optimization of AI/ML inference workflows to ensure low latency and high throughput across real-time and batch-processing paths.
This position requires deep expertise in distributed systems, event-driven architectures, and integrating multimodal AI models into production-grade moderation pipelines.
Matching Summary
The role involves architecting and operating scalable backend services for a highly scalable media intelligence platform that processes large volumes of multimedia content.
Skills & Requirements
Must-have
5-7+ years backend engineering experience
3+ years AI/ML integration in production
Python or Node.js expertise
Distributed system architecture design
Vector search and semantic retrieval
Media processing pipeline development
Nice-to-have
Experience with vLLM or Triton model serving
Knowledge of PyTorch and TensorFlow frameworks
Background in video/audio analysis
Mentoring junior engineers
Experience with cloud platforms AWS/GCP/Azure
Key Requirements
Bachelor's degree in Computer Science or equivalent
5-7+ years of backend engineering experience
3+ years of AI/ML inference integration experience