Manager Site Reliability Engineer

Workday

Not specified; not specified; not specified
Hybrid (50% in-office, flexible schedule)
3+ years leading sre or database engineering teams
8+ years software or systems engineering experience
Database internals tuning and replication topologies
Workday is seeking a Manager of Site Reliability Engineering to lead their Database Reliability Engineering team, focusing on the reliability, security, and scalability of their data infrastructure. The ideal candidate will possess extensive experience in software engineering, particularly in managing large-scale database environments, and will thrive in a collaborative and innovative culture

Job Summary

  • The role involves leading a visionary team to replace manual database interventions with automated, self-healing platforms.
  • Candidates must possess deep expertise in database internals, including engine tuning, replication topologies, and query optimization within Kubernetes.
  • Workday offers a flexible work approach requiring at least 50% time in-office or field per quarter to foster community connections.

Matching Summary

Match Score: 85

Workday is seeking a Manager of Site Reliability Engineering to lead their Database Reliability Engineering team, focusing on the reliability, security, and scalability of their data infrastructure. The ideal candidate will possess extensive experience in software engineering, particularly in managing large-scale database environments, and will thrive in a collaborative and innovative culture.

Salary

Not specified; Not specified; Not specified

Skills & Requirements

Must-have

  • 3+ years leading SRE or Database Engineering teams
  • 8+ years software or systems engineering experience
  • Database internals tuning and replication topologies
  • Kubernetes Operators and stateful sets management
  • AWS RDS/Aurora and GCP Cloud SQL experience
  • Prometheus, Grafana, Datadog observability stacks

Nice-to-have

  • Passion for Open-Source and Cloud Native solutions
  • Culture of psychological safety and technical excellence
  • Experience reducing Mean Time to Resolution (MTTR)
  • Deep-dive troubleshooting in Linux internals and networking
  • Hybrid Software/Database Engineer background

Key Requirements

  • Bachelor's degree in Computer Science or related field
  • Proven ability to lead high-stakes response for critical outages
  • Strong understanding of Agile/Scrum and Continual Improvement Process

Work Rights

Not specified

Tailored Resume

Cover Letter