Principle Sre

Barclays

Drive end-to-end resilience
Payments platform reliability
Fault tolerance and observability
The Principal Site Reliability Engineer will be a senior technical expert responsible for driving end-to-end resilience, reliability, and scalability across our mission-critical payments platform

Job Summary

  • The Principal Site Reliability Engineer will be a senior technical expert responsible for driving end-to-end resilience, reliability, and scalability across our mission-critical payments platform.
  • This role focuses on front-to-back payment flows, ensuring systems are designed for fault tolerance, observability, and operational excellence.
  • As a hands-on engineer, you will collaborate with development and production support teams, advocate chaos engineering, and build a culture of designing for failure.

Matching Summary

The Principal Site Reliability Engineer will be a senior technical expert responsible for driving end-to-end resilience, reliability, and scalability across our mission-critical payments platform.

Skills & Requirements

Must-have

  • drive end-to-end resilience
  • payments platform reliability
  • fault tolerance and observability
  • chaos engineering advocacy
  • designing for failure mindset
  • strong technical breadth

Nice-to-have

  • foster a culture of technical excellence
  • aligning technical decisions with business goals
  • incorporating new technologies
  • deep technical expert and thought leader

Key Requirements

  • 12+ years in software engineering or infrastructure roles
  • at least 5 years focused on reliability engineering or SRE
  • Proven experience building and operating fault-tolerant, highly available systems at scale
  • Strong knowledge of distributed systems, resiliency patterns
  • Expertise across infrastructure, application architecture, databases, and integration patterns
  • Ability to troubleshoot complex technical issues

Work Rights

Not specified

Tailored Resume

Cover Letter