**
Backblaze is seeking a Senior Site Reliability Engineer (SRE) to enhance the stability and scalability of its cloud storage services. The ideal candidate will have extensive experience in site reliability and systems engineering, with a focus on automation and collaboration across engineering and operations teams.
**
Job Summary
Own and drive the availability, durability, and performance of critical services across all production environments.
Design and architect scalable automation solutions to eliminate toil and improve the efficiency of operational tasks across the entire platform.
Act as a principal partner to engineering, product, and operations teams, consulting on resilient system design, architecture, and operation.
Matching Summary
Match Score: 75
**
Backblaze is seeking a Senior Site Reliability Engineer (SRE) to enhance the stability and scalability of its cloud storage services. The ideal candidate will have extensive experience in site reliability and systems engineering, with a focus on automation and collaboration across engineering and operations teams.
**
Skills & Requirements
Must-have
availability, durability, performance
incident response and post-mortems
SLIs, SLOs, error budgets
automation solutions to eliminate toil
monitoring, logging, alerting frameworks
CI/CD pipelines, IaC
production-grade code (Bash, Python, Go)
Nice-to-have
resilient system design consulting
production readiness review process
capacity planning and DR strategy
reliability-first engineering culture
Key Requirements
Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience)
8+ years of progressive experience in site reliability, systems engineering, or operations
Extensive experience designing, scaling, and operating large-scale, production-grade distributed systems