Senior Site Reliability Engineer

RBC

Montréal, Canada
Site reliability engineering (sre) best practices
Monitoring, alerting, and incident management
Cloud, linux/unix/windows and services/apis, databases
This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization

Job Summary

  • This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization.
  • Perform production support role and partner with the SRE Delivery team in incident management and problem management.
  • We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper.

Matching Summary

This role will be responsible for the development, implementation, and support of Site Reliability Engineering (SRE) solutions for applications supported by the Digital Branch SRE organization.

Skills & Requirements

Must-have

  • Site Reliability Engineering (SRE) best practices
  • monitoring, alerting, and incident management
  • Cloud, Linux/Unix/Windows and services/APIs, databases
  • scripting in Java/.NET and SQL
  • major incident handling and communication
  • SRE languages and tools (Ansible, Dynatrace, ServiceNow, GitHub, Slack, ELK stack)

Nice-to-have

  • progressive thinking and continuous improvement
  • collaboration with IT partners
  • knowledge of KAFKA, OCP, SCON infrastructure
  • cloud platform applications and processes

Key Requirements

  • 5+ years of working experience in Site Reliability Engineering (SRE)
  • Intermediate experience in a variety of environments
  • Working experience with scripting
  • Strong expertise in major incident handling
  • Ability to work in a 7x24x365 work environment

Work Rights

Not specified

Tailored Resume

Cover Letter