3+ years leading sre or database engineering teams
8+ years software or systems engineering experience
Database internals tuning and query optimization
Workday is seeking a Manager Site Reliability Engineer to lead their Database Reliability Engineering team, focusing on architecting resilient data infrastructure while fostering a collaborative and innovative work culture. The ideal candidate will have extensive experience in database technologies and team leadership, with a strong emphasis on automation and system reliability
Job Summary
The role involves leading a visionary team to replace manual database interventions with automated, self-healing platforms.
Workday is a Fortune 500 company offering a culture rooted in integrity, empathy, and shared enthusiasm for tackling big challenges.
Candidates must have extensive experience managing thousands of production databases across multiple data centers, public clouds, and geographies.
Matching Summary
Match Score: 85
Workday is seeking a Manager Site Reliability Engineer to lead their Database Reliability Engineering team, focusing on architecting resilient data infrastructure while fostering a collaborative and innovative work culture. The ideal candidate will have extensive experience in database technologies and team leadership, with a strong emphasis on automation and system reliability.
Skills & Requirements
Must-have
3+ years leading SRE or Database Engineering teams
8+ years software or systems engineering experience
Database internals tuning and query optimization
Kubernetes Operators and stateful sets management
AWS RDS/Aurora and GCP Cloud SQL experience
Prometheus, Grafana, Datadog observability stacks
Nice-to-have
Passion for Open-Source and Cloud Native solutions
Culture of psychological safety and high performance
Experience reducing Mean Time to Resolution (MTTR)
Deep-dive troubleshooting in Linux internals and networking
Key Requirements
Bachelor's degree in Computer Science or related field
4+ years as an SRE/DBRE designing resilient data infrastructure
5+ years spearheading response for critical data outages