Cloud, linux/unix/windows and services/apis, databases
This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization
Job Summary
This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization.
Perform production support role and partner with the SRE Delivery team in incident management and problem management.
We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper.
Matching Summary
This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization.
Skills & Requirements
Must-have
Site Reliability Engineering (SRE) best practices
monitoring, alerting, and incident management
Cloud, Linux/Unix/Windows and services/APIs, databases
scripting in Java/.NET and SQL
major incident handling and communication
SRE languages and tools (Ansible, Dynatrace, ServiceNow, GitHub, Slack, ELK stack)
Nice-to-have
progressive thinking and continuous improvement
collaboration with IT partners
knowledge of KAFKA, OCP, SCON infrastructure
cloud platform applications and processes
Key Requirements
5+ years of working experience in Site Reliability Engineering (SRE)
Intermediate experience in a variety of environments