Staff Software Engineer - Grafana Cloud K6 | Canada | Remote

Grafana Labs

Canada
Base: cad 186,368 - cad 223,642; bonus/equity: equ...
Remote
Devops/sre practices
Operating production systems at scale
Large-scale distributed systems
Build and scale a strong culture of operational excellence by defining standards and coaching teams to own reliability and availability

Job Summary

  • Build and scale a strong culture of operational excellence by defining standards and coaching teams to own reliability and availability.
  • Drive mature DevOps/SRE practices, including incident response and PIRs, on-call readiness, runbooks, alerting, observability, and release/change management.
  • As the reliability foundation matures, grow into broader application and product development leadership, contributing architectural and technical depth beyond operations.

Matching Summary

Build and scale a strong culture of operational excellence by defining standards and coaching teams to own reliability and availability.

Salary

Base: CAD 186,368 - CAD 223,642; Bonus/Equity: equity, bonus (if applicable); Benefits: other benefits listed here

Skills & Requirements

Must-have

  • DevOps/SRE practices
  • operating production systems at scale
  • large-scale distributed systems
  • reliability engineering concepts
  • test automation
  • clear technical communication
  • modern software engineering processes

Nice-to-have

  • containerized and cloud-native systems
  • observability tooling and platforms
  • event-driven or asynchronous systems
  • SLIs/SLOs and error budgets
  • building testing frameworks
  • developer tooling

Key Requirements

  • Strong programming background
  • Experience designing, building, and operating large-scale distributed systems
  • Experience with test automation
  • Ability to influence engineering practices
  • Strong interpersonal skills
  • Self-driven and comfortable with autonomy

Work Rights

Not specified

Tailored Resume

Cover Letter