This role is responsible for ensuring the reliability, scalability, and operational excellence of large-scale production-grade platforms within Evernorth's innovation hub
Job Summary
This role is responsible for ensuring the reliability, scalability, and operational excellence of large-scale production-grade platforms within Evernorth's innovation hub.
The engineer will lead incident response activities, drive root cause analysis, and implement preventative automation to enhance system observability.
Candidates must possess strong hands-on experience supporting large-scale systems with proven ability to lead incidents and make sound technical decisions.
Matching Summary
This role is responsible for ensuring the reliability, scalability, and operational excellence of large-scale production-grade platforms within Evernorth's innovation hub.
Skills & Requirements
Must-have
Advanced Kubernetes cluster operations
Infrastructure as Code with Terraform
Cloud platform proficiency AWS Azure
Expert Linux administration and troubleshooting
Python Bash Go scripting skills
CI/CD pipeline implementation experience
Nice-to-have
Hybrid or multi-cloud environment experience
Kubernetes certification credentials
Advanced networking security concepts
DevOps SRE best practices mentorship
Key Requirements
5-9 years in SRE DevOps or production support
Bachelor's degree in Computer Science or related field