Site Reliability Engineer (senior Or Staff), Storage Layer Services (sls)

MongoDB

Montreal, Canada
Base: $144,000 - $200,000 cad; bonus/equity: not s...
On-site
Operate distributed systems
Python, go, or similar language
Stateful storage or database systems
Partner with teams to define SLOs, shape capacity plans, and ensure reliability, durability, and operational safety of the storage layer

Job Summary

  • Partner with teams to define SLOs, shape capacity plans, and ensure reliability, durability, and operational safety of the storage layer.
  • Build for reliability, making services and infrastructure available, resilient, fault-tolerant, and self-healing.
  • Identify and configure key metrics to detect incidents and quantify service health, availability, and performance.

Matching Summary

Partner with teams to define SLOs, shape capacity plans, and ensure reliability, durability, and operational safety of the storage layer.

Salary

Base: $144,000 - $200,000 CAD; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • operate distributed systems
  • Python, Go, or similar language
  • stateful storage or database systems
  • Kubernetes containerization
  • cloud infrastructure platforms (AWS, GCP, Azure)
  • Linux OS internals and networking

Nice-to-have

  • customer-focused mindset
  • efficiency in processes and operations
  • automation over manual processes
  • leading major architectural shifts
  • managing multi-cloud environments
  • designing secure, multi-tenant environments

Key Requirements

  • 6+ years of experience in software development and operating distributed systems

Work Rights

Not specified

Tailored Resume

Cover Letter