This is a hands-on principal engineering role focused on improving Ascend's stability, performance, and operational maturity
Job Summary
This is a hands-on principal engineering role focused on improving Ascend's stability, performance, and operational maturity.
The role will lead our approach to telemetry, observability, and proactive reliability engineering, helping us detect and resolve systemic issues before they impact customers.
The person in this role will work across services, databases, infrastructure, and engineering teams to improve how we measure system behaviour, diagnose performance problems, and prioritise reliability work.
Matching Summary
This is a hands-on principal engineering role focused on improving Ascend's stability, performance, and operational maturity.
Skills & Requirements
Must-have
system stability and performance
telemetry and observability strategy
performance analysis and diagnostics
proactive reliability engineering
distributed software environments
Nice-to-have
improving customer experience
reducing operational risk
building engineering discipline
influencing architecture decisions
mentoring engineers and leaders
Key Requirements
Principal or Staff Engineer level experience
Deep expertise in observability, telemetry, monitoring, and performance engineering
Strong hands-on capability across services, databases, APIs, infrastructure, and production diagnostics
Experience improving reliability and performance in growing SaaS or enterprise platforms
Strong incident analysis and root cause investigation skills