Responsible for managing the day-to-day operations, ensuring product administration, platform reliability, and overseeing incident management and resolution processes
Job Summary
Responsible for managing the day-to-day operations, ensuring product administration, platform reliability, and overseeing incident management and resolution processes.
Own incident resolution processes for L1 and L2 operations, ensuring timely and effective troubleshooting of technical issues.
Manage SaaS platform administrations and self-hosted applications in cloud environments like AWS.
Matching Summary
Responsible for managing the day-to-day operations, ensuring product administration, platform reliability, and overseeing incident management and resolution processes.
Skills & Requirements
Must-have
Docker, Kubernetes & Helm
incident management
service management
platform performance optimization
networking fundamentals
authentication/authorization mechanisms
Nice-to-have
AI-driven customer success
workforce transformation
driving innovation
operational best practices
Key Requirements
6+ years of experience in IT
Strong understanding of IT infrastructure
Strong experience on Docker, Kubernetes & Helm
Java preferred programming language experience
Proven experience with incident management
Expertise in monitoring tools
Strong understanding of networking fundamentals
Strong understanding of authentication/authorization mechanisms