The role involves leading a high-performing team of SREs to ensure systems run seamlessly at scale for thousands of travelers daily
Job Summary
The role involves leading a high-performing team of SREs to ensure systems run seamlessly at scale for thousands of travelers daily.
Candidates will drive innovation in infrastructure design by leveraging IaC tools and microservices architectures to automate operations.
The position requires managing 24x7 production support while defining service-level objectives to track system reliability using tools like NewRelic or DataDog.
Matching Summary
The role involves leading a high-performing team of SREs to ensure systems run seamlessly at scale for thousands of travelers daily.
Skills & Requirements
Must-have
8+ years Site Reliability Engineering experience
3+ years leadership position required
AWS cloud technologies expertise
Infrastructure as Code Terraform CloudFormation
Incident management and MTTR reduction
SLO definition and observability tools
Nice-to-have
Fostering culture of collaboration
Continuous improvement mindset
Cross-functional team communication skills
Microservices architecture design
Capacity planning and hiring experience
Key Requirements
8+ years in SRE, DevOps, or Infrastructure roles
At least 3 years in a leadership position
Hands-on experience with AWS ECS Lambda DynamoDB
Proficiency in Jenkins or GitHub Actions
Strong background in designing scalable fault-tolerant systems