Senior Manager System Reliability Engineering

Emploisgevernovahydro

Not specified; not specified; relocation assistanc...
Aws core services expertise
Kubernetes internals mastery
Argocd and gitops workflows
You will serve as the ultimate authority on production stability and the final gatekeeper for all production deployments

Job Summary

  • You will serve as the ultimate authority on production stability and the final gatekeeper for all production deployments.
  • The role requires designing a standardized 'Golden Path' delivery platform using Backstage, ArgoCD, and GitHub Actions to eliminate bespoke methodologies.
  • Relocation assistance is provided for this strategic leadership position managing global 24/7 operational coverage.

Matching Summary

You will serve as the ultimate authority on production stability and the final gatekeeper for all production deployments.

Salary

Not specified; Not specified; Relocation Assistance Provided

Skills & Requirements

Must-have

  • AWS core services expertise
  • Kubernetes internals mastery
  • ArgoCD and GitOps workflows
  • Terraform Infrastructure as Code
  • Prometheus and Grafana observability
  • Incident Command leadership

Nice-to-have

  • NERC CIP compliance knowledge
  • Blameless culture facilitation
  • FinOps and capacity planning
  • VP-level stakeholder influence
  • Global team mentorship experience

Key Requirements

  • 14+ years overall experience
  • 8-10 years SRE or Platform Engineering leadership
  • Masters degree in STEM OR Bachelor + 10 years experience
  • Deep AWS and Kubernetes multi-region architecture skills

Work Rights

Not specified

Tailored Resume

Cover Letter