Moniepoint is seeking a Team Lead for Site Reliability Engineering (SRE) to oversee a team of engineers dedicated to maintaining the reliability of their financial platform. The ideal candidate should have extensive experience in SRE or backend engineering, strong leadership skills, and proficiency in programming languages such as Java, Go, Rust, or Python, with an emphasis on cloud platforms like GCP or AWS
Job Summary
You will be designing high-level reliability architecture, while also mentoring engineers, defining the technical roadmap, and driving the culture of Site Reliability Engineering within a team.
Set the technical direction for the SRE team. Architect self-healing systems, define reliability standards (Production Readiness Reviews), and drive the adoption of observability as Code and automation best practices.
You’ll receive an attractive salary, pension, health insurance, annual bonus, plus other benefits.
Matching Summary
Match Score: 85
Moniepoint is seeking a Team Lead for Site Reliability Engineering (SRE) to oversee a team of engineers dedicated to maintaining the reliability of their financial platform. The ideal candidate should have extensive experience in SRE or backend engineering, strong leadership skills, and proficiency in programming languages such as Java, Go, Rust, or Python, with an emphasis on cloud platforms like GCP or AWS.
Skills & Requirements
Must-have
design high-level reliability architecture
define reliability standards
drive adoption of observability
define end-to-end standard for system visibility
instrument their code
govern the monitoring ecosystem
refine incident management process
define Service Level Objectives (SLOs)
Nice-to-have
culture of innovation, teamwork, and growth
engineering excellence
calm authority
Key Requirements
5 years of experience in SRE or Backend Engineering
2 years in a Lead or Senior/Staff role
Expert-level proficiency in Java, Go, Rust, or Python
Mastery of distributed systems patterns
Deep expertise with Google Cloud Platform (GCP) or AWS
Extensive experience running Kubernetes (GKE) at scale