Manager Site Reliability Engineer

Workday

**
3+ years leading sre or database engineering teams
8+ years software systems engineering experience
Database internals tuning and replication topologies
** Workday is seeking a Manager Site Reliability Engineer to lead their Database Reliability Engineering team, focusing on the resiliency, security, and scalability of their data infrastructure. The ideal candidate will have extensive experience in managing large-scale database environments and a passion for automated, self-healing systems. **

Job Summary

  • The role involves leading a visionary team to replace manual database interventions with automated, self-healing platforms.
  • Candidates must possess deep technical expertise in database internals, including engine tuning, replication topologies, and query optimization.
  • Workday offers a flexible work approach requiring at least half of the time each quarter to be spent in-office or with customers.

Matching Summary

Match Score: 75

** Workday is seeking a Manager Site Reliability Engineer to lead their Database Reliability Engineering team, focusing on the resiliency, security, and scalability of their data infrastructure. The ideal candidate will have extensive experience in managing large-scale database environments and a passion for automated, self-healing systems. **

Skills & Requirements

Must-have

  • 3+ years leading SRE or Database Engineering teams
  • 8+ years software systems engineering experience
  • Database internals tuning and replication topologies
  • Kubernetes Operators and stateful sets management
  • AWS RDS Aurora and GCP Cloud SQL experience
  • Prometheus Grafana Datadog observability stacks
  • Automated failover mechanisms implementation

Nice-to-have

  • Passion for Open-Source and Cloud Native solutions
  • Culture of psychological safety and high performance
  • Experience reducing Mean Time to Resolution MTTR
  • Deep-dive troubleshooting in Linux internals
  • Agile Scrum and Continual Improvement Process knowledge

Key Requirements

  • Bachelor's degree in Computer Science or related field
  • 5+ years spearheading critical data outage response
  • Proven ability to mentor senior engineers
  • Experience managing distributed system latency issues

Work Rights

Not specified

Tailored Resume

Cover Letter