Manager Site Reliability Engineer

Workday

Hybrid
3+ years leading sre or database engineering teams
8+ years software or systems engineering experience
Experience managing databases within kubernetes
The role involves leading a visionary team to replace manual database interventions with automated, self-healing platforms

Job Summary

  • The role involves leading a visionary team to replace manual database interventions with automated, self-healing platforms.
  • Candidates must have extensive experience designing resilient data infrastructure and implementing automated failover mechanisms at scale.
  • Workday offers a flexible work approach requiring at least half of the time to be spent in-office or with customers each quarter.

Matching Summary

The role involves leading a visionary team to replace manual database interventions with automated, self-healing platforms.

Skills & Requirements

Must-have

  • 3+ years leading SRE or Database Engineering teams
  • 8+ years software or systems engineering experience
  • Experience managing databases within Kubernetes
  • Expertise in database internals and query optimization
  • Proven ability to reduce Mean Time to Resolution
  • Experience with AWS RDS/Aurora and GCP Cloud SQL

Nice-to-have

  • Passion for Open-Source and Cloud Native solutions
  • Fostering a culture of psychological safety
  • Deep-dive troubleshooting skills for Linux internals
  • Strong understanding of Agile/Scrum methodologies
  • Curious minds and courageous collaborators

Key Requirements

  • Bachelor's degree in Computer Science or related field
  • 4+ years as an SRE/DBRE
  • 5+ years spearheading response for critical data outages
  • Experience with Prometheus, Grafana, Datadog, or PMM

Work Rights

Not specified

Tailored Resume

Cover Letter