Azure resources expertise including aks and app services
Azure devops and gitops pipeline development
Azure powershell scripting and infrastructure automation
The Senior Site Reliability Engineer will safeguard operational excellence for mission-critical platforms under Stratos by defining and tracking SLIs, SLOs, and Error Budgets
Job Summary
The Senior Site Reliability Engineer will safeguard operational excellence for mission-critical platforms under Stratos by defining and tracking SLIs, SLOs, and Error Budgets.
This role requires close collaboration with Development and Production Support teams to drive service reliability, availability, and scalability while minimizing toil through automation.
Candidates must possess deep technical expertise in Azure services, C#, and monitoring tools to handle production incidents and implement continuous improvements.
Matching Summary
The Senior Site Reliability Engineer will safeguard operational excellence for mission-critical platforms under Stratos by defining and tracking SLIs, SLOs, and Error Budgets.
Skills & Requirements
Must-have
Azure resources expertise including AKS and App Services
Azure DevOps and GitOps pipeline development
Azure PowerShell scripting and infrastructure automation
Monitoring tools proficiency with Grafana Dynatrace Splunk
C# programming language proficiency
Production incident response and root cause analysis
Nice-to-have
Experience with Terraform Docker and Kubernetes
Knowledge of Navitaire cloud implementation
Ability to adapt to emerging cloud technologies
Strong problem-solving and analytical skills
Effective collaboration in multi-cultural environments
Key Requirements
Hands-on experience with Azure Storage Network Functions Logic Apps
Proficiency in developing Azure Runbooks and infrastructure automation
Proven ability to work in a dynamic fast-paced environment
Experience with CI/CD pipelines using Azure DevOps