Own the stability, scalability, and performance of production-grade ML platforms by designing and enhancing backend services, orchestration layers, and system integrations that power critical ML workflows
Job Summary
Own the stability, scalability, and performance of production-grade ML platforms by designing and enhancing backend services, orchestration layers, and system integrations that power critical ML workflows.
Ensure resilient system architectures in distributed settings to preserve high availability, tolerate faults, and enable smooth operation.
Act as a senior technical anchor for cross-functional teams and external vendors, translating complex ML and business requirements into resilient, scalable backend solutions.
Matching Summary
Own the stability, scalability, and performance of production-grade ML platforms by designing and enhancing backend services, orchestration layers, and system integrations that power critical ML workflows.
Skills & Requirements
Must-have
ML platform stability and scalability
backend services for ML workflows
resilient system architectures
Docker and Kubernetes (AKS)
CI/CD workflows
Python backend development
Nice-to-have
methodical problem-solving approach
willingness to remain competitive
advancing innovation and efficiency
Key Requirements
3-5 years of experience
Backend Engineer, Platform Engineer, Senior ML Engineer