Infrastructure as code (terraform, ansible, puppet)
Object-oriented programming skills
Lead the design, implementation, and ongoing improvement of reliable, scalable, performant, and secure production platforms and services
Job Summary
Lead the design, implementation, and ongoing improvement of reliable, scalable, performant, and secure production platforms and services.
Provide technical leadership and mentorship to engineers across the organisation, promoting strong engineering standards and operational best practice.
Contribute across the full lifecycle of platform and service delivery, from design and build through to operation and optimisation.
Matching Summary
Lead the design, implementation, and ongoing improvement of reliable, scalable, performant, and secure production platforms and services.
Skills & Requirements
Must-have
Kubernetes at scale
Infrastructure as code (Terraform, Ansible, Puppet)
Object-oriented programming skills
Cloud platform experience (AWS, GCP, OCI)
Monitoring and alerting tools (Prometheus, Grafana)
Linux/Windows systems administration
CI/CD and secure SDLC practices
Nice-to-have
Kafka experience
Database management (MySQL, PostgreSQL, Redis)
Networking fundamentals
Distributed systems understanding
SRE concepts (SLIs, SLOs, toil reduction)
Elasticsearch production management
Key Requirements
5+ years of experience in DevOps, SRE, platform engineering, or software engineering
Strong Kubernetes experience
Hands-on experience with infrastructure as code tools
Strong programming skills in at least one object-oriented language
Significant hands-on experience in at least one major cloud platform
Strong monitoring, alerting, and observability experience
Solid understanding of networking fundamentals and distributed systems
Strong Linux and/or Windows systems administration experience
Experience with software delivery automation, CI/CD pipelines