Site Reliability Engineering is a production-oriented discipline focused on improving system service availability, observability, scalability, performance, and reliability for technology products by applying sound software engineering principles and adopting the latest technology and tooling
Job Summary
Site Reliability Engineering is a production-oriented discipline focused on improving system service availability, observability, scalability, performance, and reliability for technology products by applying sound software engineering principles and adopting the latest technology and tooling.
Responsibilities include diagnosing and resolving issues across the entire stack (hardware, software, application, network), identifying and driving automation opportunities, and proactively addressing system reliability risks.
The company values courageous teammates, needle-movers, and learning champions, striving to support the health and well-being of all employees and celebrating diversity and inclusion.
Matching Summary
Site Reliability Engineering is a production-oriented discipline focused on improving system service availability, observability, scalability, performance, and reliability for technology products by applying sound software engineering principles and adopting the latest technology and tooling.