Senior Site Reliability Engineer, Storage Layer Services (sls)

MongoDB

Montreal, Canada
Base: $144,000 - $200,000 cad; bonus/equity: not s...
On-site
Operate distributed systems at scale
Python, go, or similar language proficiency
Operate stateful storage or database systems
Partner with teams to define SLOs, shape capacity plans, and ensure the reliability, durability, and operational safety of the storage layer

Job Summary

  • Partner with teams to define SLOs, shape capacity plans, and ensure the reliability, durability, and operational safety of the storage layer.
  • Build for reliability, making services and infrastructure available, resilient, fault-tolerant, and self-healing.
  • Identify and configure key metrics to detect incidents and quantify service health, availability, and performance.

Matching Summary

Partner with teams to define SLOs, shape capacity plans, and ensure the reliability, durability, and operational safety of the storage layer.

Salary

Base: $144,000 - $200,000 CAD; Bonus/Equity: Not specified; Benefits: equity, ESPP, PTO, parental leave, fertility assistance, RRSP match, mental health, backup care, health, dental, vision

Skills & Requirements

Must-have

  • operate distributed systems at scale
  • Python, Go, or similar language proficiency
  • operate stateful storage or database systems
  • Kubernetes experience
  • cloud infrastructure platforms (AWS, GCP, Azure)
  • Linux OS internals and networking

Nice-to-have

  • customer-focused mindset
  • value efficiency in processes
  • prefer automation over manual processes
  • leading major architectural shifts
  • managing multi-cloud infrastructure

Key Requirements

  • 6+ years of experience
  • operate stateful storage or database systems at scale
  • experience using and extending containerization technologies
  • expertise in cloud infrastructure platforms
  • understanding of Linux operating system internals

Work Rights

Not specified

Tailored Resume

Cover Letter