Manager Site Reliability Engineer

Workday

Hybrid (at least 50% of time in-office or in the field per quarter)
3+ years leading sre or database engineering teams
8+ years software or systems engineering experience
Database internals tuning and replication topologies
Workday is seeking a Manager Site Reliability Engineer to lead their Database Reliability Engineering team, focusing on high-performance, scalable data infrastructure. The ideal candidate should possess extensive experience in database management and engineering, with a strong emphasis on automation and reliability

Job Summary

  • The role involves leading a visionary team to replace manual database interventions with automated, self-healing platforms.
  • Candidates must have extensive experience managing thousands of production databases across multiple data centers and public clouds.
  • Workday offers a flexible work approach requiring at least 50% time in-office or field per quarter to deepen team connections.

Matching Summary

Match Score: 85

Workday is seeking a Manager Site Reliability Engineer to lead their Database Reliability Engineering team, focusing on high-performance, scalable data infrastructure. The ideal candidate should possess extensive experience in database management and engineering, with a strong emphasis on automation and reliability.

Skills & Requirements

Must-have

  • 3+ years leading SRE or Database Engineering teams
  • 8+ years software or systems engineering experience
  • Database internals tuning and replication topologies
  • Kubernetes Operators and stateful sets management
  • AWS RDS/Aurora and GCP Cloud SQL experience
  • Observability stacks like Prometheus and Grafana
  • Automated failover mechanisms implementation

Nice-to-have

  • Passion for Open-Source and Cloud Native solutions
  • Fostering culture of psychological safety
  • Experience with Agile/Scrum and CIP processes
  • Deep-dive troubleshooting in Linux internals
  • Mentoring senior engineers effectively

Key Requirements

  • Bachelor's degree in Computer Science or related field
  • 5+ years spearheading high-stakes response for critical outages
  • Proven ability to reduce Mean Time to Resolution (MTTR)

Work Rights

Not specified

Tailored Resume

Cover Letter