The Cloud Persistence team delivers the mission-critical storage layer for AEM Cloud Service, ensuring content consistency, durability, and fast access at scale
Job Summary
The Cloud Persistence team delivers the mission-critical storage layer for AEM Cloud Service, ensuring content consistency, durability, and fast access at scale.
You will own the reliability, performance, and operational readiness of storage components, building and improving monitoring, alerting, and incident management processes.
This role involves partnering with engineers to design observable, scalable systems and automating repetitive operational work, operating at thousands-of-clusters scale.
Matching Summary
The Cloud Persistence team delivers the mission-critical storage layer for AEM Cloud Service, ensuring content consistency, durability, and fast access at scale.
Skills & Requirements
Must-have
reliability, performance, operational readiness
monitoring, alerting, dashboards, playbooks
SLIs/SLOs, error budgets
incident analysis, root-cause fixes
distributed systems debugging
Kubernetes and cloud experience
Nice-to-have
curiosity, willingness to learn
collaborative mindset
Java/JVM familiarity
Key Requirements
6+ years SRE/production engineering experience
Bachelor's or Master's degree in Computer Science or equivalent experience
Strong experience with observability stacks
Solid understanding of SLIs/SLOs, incident management