Manager Site Reliability Engineer

Workday

Not specified; not specified; not specified
Hybrid (at least 50% in-office or field interactions quarterly)
3+ years leading sre or database engineering teams
8+ years software systems engineering experience
Database internals tuning and query optimization
Workday is seeking a Manager Site Reliability Engineer to lead their Database Reliability Engineering team, focusing on architecting and managing scalable data infrastructure. The ideal candidate should have extensive experience in software or systems engineering, particularly in leading teams that ensure the reliability and performance of large-scale database environments

Job Summary

  • The role involves leading a team dedicated to the resiliency, security, and scalability of the data layer while moving beyond traditional DBA paradigms.
  • Candidates must have extensive experience managing databases within Kubernetes using Operators or stateful sets and implementing robust observability stacks.
  • Workday offers a flexible work model requiring at least half of the time each quarter to be spent in-office or with customers.

Matching Summary

Match Score: 85

Workday is seeking a Manager Site Reliability Engineer to lead their Database Reliability Engineering team, focusing on architecting and managing scalable data infrastructure. The ideal candidate should have extensive experience in software or systems engineering, particularly in leading teams that ensure the reliability and performance of large-scale database environments.

Salary

Not specified; Not specified; Not specified

Skills & Requirements

Must-have

  • 3+ years leading SRE or Database Engineering teams
  • 8+ years software systems engineering experience
  • Database internals tuning and query optimization
  • Kubernetes Operators and stateful sets management
  • AWS RDS Aurora and GCP Cloud SQL experience
  • Implementing automated failover mechanisms
  • Observability stack with Prometheus Grafana Datadog

Nice-to-have

  • Passion for Open-Source and Cloud Native solutions
  • Fostering culture of psychological safety
  • Deep-dive troubleshooting in Linux internals
  • Experience reducing Mean Time to Resolution
  • Visionary approach to data infrastructure architecture

Key Requirements

  • Bachelor's degree in Computer Science or related field
  • 5+ years spearheading response for critical data outages
  • Strong understanding of Agile/Scrum and Continual Improvement Process

Work Rights

Not specified

Tailored Resume

Cover Letter