Azure resources expertise including storage network functions
Azure devops gitops pipelines and build/release management
Azure powershell scripts and infrastructure automation tools
The Lead Site Reliability Engineer will safeguard operational excellence for mission-critical platforms under Stratos by defining SLOs and driving reliability strategies
Job Summary
The Lead Site Reliability Engineer will safeguard operational excellence for mission-critical platforms under Stratos by defining SLOs and driving reliability strategies.
This role requires deep expertise in Azure architecture to optimize costs, reduce manual toil, and implement infrastructure-as-code practices using Terraform and Kubernetes.
Collaboration with Development and Production Support teams is essential to design code meeting target SLOs and lead production incident response efforts.
Matching Summary
The Lead Site Reliability Engineer will safeguard operational excellence for mission-critical platforms under Stratos by defining SLOs and driving reliability strategies.
Skills & Requirements
Must-have
Azure resources expertise including Storage Network Functions
Azure DevOps GitOps pipelines and build/release management
Azure PowerShell scripts and infrastructure automation tools
Monitoring and logging tools Grafana Dynatrace Splunk