Lead the establishment of SRE foundations for new projects building environments, monitoring, alerting, and ensuring operational readiness from day one
Job Summary
Lead the establishment of SRE foundations for new projects building environments, monitoring, alerting, and ensuring operational readiness from day one.
Collaborate with Architecture and Engineering teams to embed reliability, scalability, security, and observability into system design.
Be a technical leader and mentor supporting engineers, shaping engineering standards, and fostering a culture of learning and development.
Matching Summary
Lead the establishment of SRE foundations for new projects building environments, monitoring, alerting, and ensuring operational readiness from day one.
Skills & Requirements
Must-have
AWS services (EKS, ECS, EC2)
Kubernetes and containerised platforms
Linux systems administration
Datadog for metrics, logs, APM
SRE principles (SLOs, error budgets)
Cloud security principles
Nice-to-have
Continuous improvement approach
Technical leadership and mentorship
Calm and methodical under pressure
Pragmatic problem-solver
Clear communicator of complex concepts
Key Requirements
10+ years of hands-on technical experience
Bachelor’s Degree in Computer Science or related field
Experience with AWS
Experience with Kubernetes
Experience with Linux systems administration
Experience designing and operating observability platforms
Experience with Datadog
Experience with SRE principles
Experience working with architecture and engineering teams