Manager Site Reliability Engineer

Workday

Hybrid (at least 50% in-office time)
3+ years leading sre or database engineering teams
8+ years in software or systems engineering
Database internals tuning and replication topologies
Workday is seeking a Manager of Site Reliability Engineering to lead their Database Reliability Engineering team. The ideal candidate will possess extensive experience in software and systems engineering, particularly in managing large-scale database environments, and will focus on building resilient, automated data infrastructure

Job Summary

  • The role involves leading a visionary team to replace manual database interventions with automated, self-healing platforms.
  • Candidates must possess deep expertise in database internals, including engine tuning, replication topologies, and query optimization.
  • Workday offers a flexible work approach requiring at least 50% time in-office or with customers each quarter.

Matching Summary

Match Score: 85

Workday is seeking a Manager of Site Reliability Engineering to lead their Database Reliability Engineering team. The ideal candidate will possess extensive experience in software and systems engineering, particularly in managing large-scale database environments, and will focus on building resilient, automated data infrastructure.

Skills & Requirements

Must-have

  • 3+ years leading SRE or Database Engineering teams
  • 8+ years in software or systems engineering
  • Database internals tuning and replication topologies
  • Kubernetes Operators and stateful sets management
  • AWS RDS/Aurora and GCP Cloud SQL experience
  • Prometheus, Grafana, Datadog observability stacks

Nice-to-have

  • Passion for Open-Source and Cloud Native solutions
  • Culture of psychological safety and high performance
  • Experience reducing Mean Time to Resolution (MTTR)
  • Deep-dive troubleshooting of Linux internals and networking
  • Mentoring senior engineers and fostering technical excellence

Key Requirements

  • Bachelor's degree in Computer Science or related field
  • 4+ years as an SRE/DBRE designing resilient data infrastructure
  • 5+ years spearheading response for critical data outages

Work Rights

Not specified

Tailored Resume

Cover Letter