Site Reliability Engineer - Multicloud Platform

Workday

Fully remote
3+ years sre experience
Kubernetes expertise required
Public cloud experience aws gcp azure
Workday is seeking a Site Reliability Engineer for their multicloud platform, focusing on ensuring the reliability and availability of their services. The ideal candidate will have a strong background in distributed systems, cloud environments, and automation, with a passion for improving operational efficiency

Job Summary

  • The primary function of the SRE team is to ensure the reliability and availability of the platform to meet desired SLAs while reducing operational load.
  • Engineers will own the reliability for the complete stack and tools that deliver Workday products across public clouds using Cloud Native technologies.
  • The role requires developing effective SLIs, building an extendable Observability architecture, and establishing new processes to improve customer happiness.

Matching Summary

Match Score: 85

Workday is seeking a Site Reliability Engineer for their multicloud platform, focusing on ensuring the reliability and availability of their services. The ideal candidate will have a strong background in distributed systems, cloud environments, and automation, with a passion for improving operational efficiency.

Skills & Requirements

Must-have

  • 3+ years SRE experience
  • Kubernetes expertise required
  • Public cloud experience AWS GCP Azure
  • GoLang Python or Ruby proficiency
  • Linux operating system knowledge
  • CI/CD and code management skills

Nice-to-have

  • Passion for automation culture
  • Experience with distributed systems
  • Strong documentation and runbook skills
  • Collaboration with global remote teams
  • Fast-paced environment adaptability

Key Requirements

  • BS in Computer Science or equivalent experience
  • 3+ years handling distributed systems in public cloud
  • 1+ years SRE experience in distributed systems (Senior Associate level)

Work Rights

Not specified

Tailored Resume

Cover Letter