Proficiency in golang, python, or ruby programming
Workday is seeking a Site Reliability Engineer for its Multicloud Platform, focusing on ensuring the reliability and availability of its cloud-native services. The ideal candidate will have a strong background in distributed systems, particularly with Kubernetes and cloud platforms like AWS, GCP, or Azure, and will thrive in a collaborative, fast-paced environment
Job Summary
The primary function of the team is to ensure the reliability and availability of the platform to meet desired SLAs while reducing operational load.
Engineers will develop and launch effective SLIs to ensure SLOs are achieved through building an extendable Observability architecture and runbook automation.
Workday offers a flexible work approach requiring at least half of the time each quarter to be spent in-office or with customers, partners, and prospects.
Matching Summary
Match Score: 88
Workday is seeking a Site Reliability Engineer for its Multicloud Platform, focusing on ensuring the reliability and availability of its cloud-native services. The ideal candidate will have a strong background in distributed systems, particularly with Kubernetes and cloud platforms like AWS, GCP, or Azure, and will thrive in a collaborative, fast-paced environment.
Salary
Not specified; Not specified; Not specified
Skills & Requirements
Must-have
3+ years SRE experience in distributed systems
Strong Kubernetes experience in public cloud
Proficiency in GoLang, Python, or Ruby programming
Experience with AWS, GCP, or Azure environments
Linux operating system administration skills
Nice-to-have
Passion for automation and reducing operational toil
Experience collaborating with global remote teams
Excellent documentation and runbook development skills
Background presenting at Cloud Native conferences
Key Requirements
BS in Computer Science or equivalent years of experience
1-3+ years handling distributed systems in public cloud