3+ years leading sre or database engineering teams
8+ years software systems engineering experience
Database internals tuning and query optimization
**
Workday is seeking a Manager of Site Reliability Engineering to lead their Database Reliability Engineering team, focusing on high-performance and scalable data infrastructure. The ideal candidate should have extensive experience in software or systems engineering, specifically in managing large-scale database environments with a strong emphasis on automation and resilience.
**
Job Summary
The role involves leading a visionary team to replace manual database interventions with automated, self-healing platforms.
Candidates must possess deep technical expertise in database internals, replication topologies, and managing databases within Kubernetes.
Workday offers a flexible work approach requiring at least 50% time in-office or field per quarter while supporting remote home office roles.
Matching Summary
Match Score: 75
**
Workday is seeking a Manager of Site Reliability Engineering to lead their Database Reliability Engineering team, focusing on high-performance and scalable data infrastructure. The ideal candidate should have extensive experience in software or systems engineering, specifically in managing large-scale database environments with a strong emphasis on automation and resilience.
**
Salary
Not specified; Not specified; Not specified
Skills & Requirements
Must-have
3+ years leading SRE or Database Engineering teams
8+ years software systems engineering experience
Database internals tuning and query optimization
Kubernetes Operators and stateful sets management
AWS RDS Aurora and GCP Cloud SQL experience
Prometheus Grafana Datadog observability stacks
Automated failover mechanisms implementation
Nice-to-have
Passion for Open-Source and Cloud Native solutions
Fostering culture of psychological safety
Deep-dive troubleshooting on Linux internals
Mentoring senior engineers effectively
Reducing toil through automation initiatives
Key Requirements
Bachelor's degree in Computer Science or related field
5+ years experience spearheading critical data outage response
Strong understanding of Agile/Scrum and Continual Improvement Process