The Principal Site Reliability Engineer will be a senior technical expert responsible for driving end-to-end resilience, reliability, and scalability across our mission-critical payments platform
Job Summary
The Principal Site Reliability Engineer will be a senior technical expert responsible for driving end-to-end resilience, reliability, and scalability across our mission-critical payments platform.
This role focuses on front-to-back payment flows, ensuring systems are designed for fault tolerance, observability, and operational excellence.
As a hands-on engineer, you will collaborate with development and production support teams, advocate chaos engineering, and build a culture of designing for failure.
Matching Summary
The Principal Site Reliability Engineer will be a senior technical expert responsible for driving end-to-end resilience, reliability, and scalability across our mission-critical payments platform.
Skills & Requirements
Must-have
drive end-to-end resilience
payments platform reliability
fault tolerance and observability
chaos engineering advocacy
designing for failure mindset
strong technical breadth
Nice-to-have
foster a culture of technical excellence
aligning technical decisions with business goals
incorporating new technologies
deep technical expert and thought leader
Key Requirements
12+ years in software engineering or infrastructure roles
at least 5 years focused on reliability engineering or SRE
Proven experience building and operating fault-tolerant, highly available systems at scale
Strong knowledge of distributed systems, resiliency patterns
Expertise across infrastructure, application architecture, databases, and integration patterns