Oversee 24x7 monitoring of all infrastructure and applications using SolarWinds, Splunk, and BigPanda, ensuring timely detection, triage, and escalation of events
Job Summary
Oversee 24x7 monitoring of all infrastructure and applications using SolarWinds, Splunk, and BigPanda, ensuring timely detection, triage, and escalation of events.
Lead the end‑to‑end incident management process, driving rapid response, coordinated resolution, and adherence to SLAs.
Manage shift operations and ensure effective 24x7 coverage while driving capability development and cross-skilling across the NOC team.
Matching Summary
Oversee 24x7 monitoring of all infrastructure and applications using SolarWinds, Splunk, and BigPanda, ensuring timely detection, triage, and escalation of events.
Skills & Requirements
Must-have
24x7 infrastructure monitoring
Incident management process
Monitoring standards optimization
Runbook development and maintenance
Trend analysis and RCAs
Capacity and performance monitoring
Change management collaboration
Nice-to-have
Infectious enthusiasm for service delivery
Ability to act with influence
Professionalism and courtesy
Continuous process improvement
Employee well-being focus
Key Requirements
4+ years leadership experience in global tech service delivery