Hands-on kubernetes management and troubleshooting
Proficiency in golang, python, or ruby programming
The team is dedicated to improving platform reliability, observability, and delivering operational success at scale using cloud-native technologies
Job Summary
The team is dedicated to improving platform reliability, observability, and delivering operational success at scale using cloud-native technologies.
Engineers will design and implement automation solutions to reduce manual effort and enable the team to operate large-scale distributed systems.
The role involves collaborating with global counterparts across New Zealand, US, and Ireland to ensure continuous platform coverage through a follow-the-sun model.
Matching Summary
The team is dedicated to improving platform reliability, observability, and delivering operational success at scale using cloud-native technologies.
Skills & Requirements
Must-have
1 to 8 years SRE or DevOps experience
Hands-on Kubernetes management and troubleshooting
Proficiency in GoLang, Python, or Ruby programming
Experience with AWS, GCP, or Azure public clouds
Solid Linux/Unix operating system background
Nice-to-have
Familiarity with Istio, OPA, Prometheus, and Grafana
Experience contributing to Cloud Native conferences
Knowledge of SLO-gated multi-stage deployment automation
Participation in follow-the-sun on-call models
Strong documentation and runbook creation skills
Key Requirements
BS in Computer Science or equivalent practical experience
1 to 8 years of site reliability engineering experience
Deep understanding of CNCF technologies in cloud environments