This role focuses on designing and implementing resilient Business Continuity Planning and Disaster Recovery capabilities across AWS environments to protect mission-critical operations
Job Summary
This role focuses on designing and implementing resilient Business Continuity Planning and Disaster Recovery capabilities across AWS environments to protect mission-critical operations.
The successful candidate will engineer multi-region architectures using Kubernetes and Terraform while automating failover, backup, and recovery testing procedures.
You will collaborate closely with security and infrastructure teams to validate resiliency through chaos engineering practices and regular tabletop exercises.
Matching Summary
This role focuses on designing and implementing resilient Business Continuity Planning and Disaster Recovery capabilities across AWS environments to protect mission-critical operations.
Skills & Requirements
Must-have
5+ years cloud infrastructure experience
3+ years hands-on AWS production environment
Multi-region DR architecture design
Terraform Infrastructure as Code proficiency
Python or Bash scripting for automation
Kubernetes container orchestration knowledge
Active-active and active-passive failover patterns
Nice-to-have
AWS Certified Solutions Architect Professional
Experience with AWS Resilience Hub
Knowledge of EKS multi-region DR patterns
Familiarity with CloudEndure or DRS tools
Serverless DR pattern implementation
Strong executive-level communication skills
Key Requirements
5+ years in cloud infrastructure or SRE roles
3+ years hands-on AWS experience at scale
Expert proficiency with core AWS resilience services
Proven delivery of tested multi-region DR architectures
Strong scripting skills in Python, Bash, or PowerShell