This role focuses on designing and maintaining the infrastructure that powers enterprise-scale generative AI applications
Job Summary
This role focuses on designing and maintaining the infrastructure that powers enterprise-scale generative AI applications.
The successful candidate will build scalable backend services using microservices architecture and cloud-native patterns to support AI model deployment.
Collaboration with AI engineers and product stakeholders is essential to translate AI capabilities into production-ready, high-availability systems.
Matching Summary
This role focuses on designing and maintaining the infrastructure that powers enterprise-scale generative AI applications.
Skills & Requirements
Must-have
Python backend development
Kubernetes container orchestration
AWS cloud infrastructure
AI model deployment and serving
Microservices architecture design
CI/CD pipeline implementation
Vector database integration
Nice-to-have
Go or Node.js familiarity
OpenShift experience
Serverless function optimization
Cost-effective inference strategies
Event-driven architecture knowledge
Key Requirements
4–6 years of backend engineering experience
2+ years of Kubernetes and cloud infrastructure experience
Bachelor's degree in Computer Science or related field
Proven experience with AI/ML model deployment in production