This role involves acting as the first responder for production incidents to ensure quick triage and resolution of critical issues within a cloud-based SaaS platform
Job Summary
This role involves acting as the first responder for production incidents to ensure quick triage and resolution of critical issues within a cloud-based SaaS platform.
The successful candidate will collaborate with development and architecture teams to improve observability, logging, and alerting across the application stack in a true DevOps manner.
Candidates must be willing to participate in on-call rotation and handle production support while mentoring junior engineers and working with global stakeholders.
Matching Summary
This role involves acting as the first responder for production incidents to ensure quick triage and resolution of critical issues within a cloud-based SaaS platform.
Skills & Requirements
Must-have
AWS Cloud Native application experience
CI/CD tools proficiency Azure DevOps
Scripting languages Python Bash
Relational database PostgreSQL SQL Server
Production incident response and triage
Nice-to-have
Telecom billing domain knowledge
Mentoring junior engineers
Global team collaboration skills
Agile ceremony participation
Key Requirements
Bachelor's degree in Computer Science or related field
Minimum 3 years managing enterprise-grade applications