Head Of Site Reliability Engineering

SHORE Solutions Inc

Not specified; not specified; not specified
Hybrid
Aws production services ownership
Pulumi typescript infrastructure-as-code
Slos and error budget enforcement
The role involves owning the reliability of production services running on AWS while steering the roadmap for platform resilience

Job Summary

  • The role involves owning the reliability of production services running on AWS while steering the roadmap for platform resilience.
  • You will lead and grow a remote team of SREs by coaching, hiring, performance-managing, and fostering a blameless culture.
  • This position offers the opportunity to build reliability engineering from the ground up in a mission-critical IoT platform.

Matching Summary

The role involves owning the reliability of production services running on AWS while steering the roadmap for platform resilience.

Salary

Not specified; Not specified; Not specified

Skills & Requirements

Must-have

  • AWS production services ownership
  • Pulumi TypeScript Infrastructure-as-Code
  • SLOs and error budget enforcement
  • Remote team leadership and hiring
  • Incident command and post-mortem leadership

Nice-to-have

  • Blameless culture fostering
  • IoT platform experience
  • DevSecOps practices championing
  • SOC 2 & ISO 27001 compliance knowledge
  • Hands-on monitoring participation

Key Requirements

  • Experience leading SRE teams
  • Proficiency with Pulumi and TypeScript
  • Deep knowledge of AWS services (EKS, MSK, etc.)

Work Rights

Not specified

Tailored Resume

Cover Letter