End-to-end accountability for production environment
Incident commander during major incidents
Build and mentor sre team
Assume end-to-end accountability for the Clearing production environment, ensuring high availability, optimal performance, and robust resilience of business-critical systems
Job Summary
Assume end-to-end accountability for the Clearing production environment, ensuring high availability, optimal performance, and robust resilience of business-critical systems.
Act as Incident Commander during major incidents, leading resolution efforts, managing stakeholder communications, and driving root cause analysis and remediation.
Establish and maintain robust observability practices, employing metrics, logging, and tracing to drive data-driven decisions and improve system health.
Matching Summary
Assume end-to-end accountability for the Clearing production environment, ensuring high availability, optimal performance, and robust resilience of business-critical systems.
Skills & Requirements
Must-have
end-to-end accountability for production environment
Incident Commander during major incidents
build and mentor SRE team
response and resolution SLAs
strong partnerships across LCH and LSEG
establish and maintain observability practices
Deep technical expertise in Oracle database
implementing SRE frameworks
leading teams supporting mixed infrastructure
delivering automation at scale
Expertise in automation (Python, Shell, PowerShell etc.)
Nice-to-have
fostering a culture of accountability
driving data-driven decisions
proactive risk identification and mitigation
knowledge of clearing and settlement processes
familiarity with financial services regulatory requirements
Key Requirements
3+ years in a leadership capacity
Degree educated or equivalent work experience
Experience supporting systems deployed across AWS preferred
Knowledge of financial markets
Familiarity with financial services governance frameworks