Manager, Site Reliability Engineering (sre)

Epic

Dublin, Ireland
Fully remote
Leading sre teams
Aws and gcp cloud platforms
Automation tools and observability solutions
As a Fortune 500 company and a leading AI platform, we’re shaping the future of work so teams can reach their potential and focus on what matters most

Job Summary

  • As a Fortune 500 company and a leading AI platform, we’re shaping the future of work so teams can reach their potential and focus on what matters most.
  • The Tenant Lifecycle Engineering organisation is responsible for pushing new code to customers and monitoring the health, performance, and reliability of the Workday stack.
  • Our approach to flexible work combines in-person time and remote work, enabling teams to deepen connections and maintain a strong community.

Matching Summary

As a Fortune 500 company and a leading AI platform, we’re shaping the future of work so teams can reach their potential and focus on what matters most.

Skills & Requirements

Must-have

  • Leading SRE teams
  • AWS and GCP cloud platforms
  • Automation tools and observability solutions
  • Incident management processes
  • Agile methodology principles
  • Continuous Delivery integration
  • People management and mentoring

Nice-to-have

  • Strong facilitation skills
  • Developing leadership skills
  • Relationship building with teams
  • Team performance improvement
  • Personal initiative in SRE
  • Troubleshooting system incidents
  • Empathy and integrity culture

Key Requirements

  • 3+ years leading SRE teams
  • 8+ years software development engineering experience
  • Bachelor’s degree or equivalent practical experience
  • Experience with AWS, GCP, Kubernetes, Ansible, Jenkins, GIT, Argo
  • Experience managing cloud platform teams
  • Good understanding of Agile and Continual Improvement Process
  • Strong people management skills

Work Rights

Not specified

Tailored Resume

Cover Letter