This role ensures the reliability and operability of enterprise platforms across hybrid cloud and on-prem environments through engineering-driven operations
Job Summary
This role ensures the reliability and operability of enterprise platforms across hybrid cloud and on-prem environments through engineering-driven operations.
The successful candidate will lead L3 incident resolution, define SLOs/KPIs, and drive continuous improvement via automation and Infrastructure-as-Code.
The company offers comprehensive benefits including professional development programs, well-being initiatives, and a strong commitment to diversity and inclusion.
Matching Summary
This role ensures the reliability and operability of enterprise platforms across hybrid cloud and on-prem environments through engineering-driven operations.
Skills & Requirements
Must-have
5+ years platform SRE experience
Hybrid cloud and on-prem operations
Terraform and Ansible IaC expertise
Python scripting proficiency
L3 incident leadership and RCA
Nice-to-have
AIOps and ML anomaly detection
Azure platform knowledge
Container and DevOps exposure
Engineering mindset for toil reduction
Global matrixed environment collaboration
Key Requirements
5+ years in platform/SRE/operations roles
Strong infrastructure fundamentals in networking and virtualization