Senior Manager System Reliability Engineering

GE VERNOVA

Aws core services expertise
Kubernetes internals mastery
Argocd and gitops workflows
The Senior Manager will serve as the ultimate authority on production stability and the final gatekeeper for all deployments across the global GridOS SaaS portfolio

Job Summary

  • The Senior Manager will serve as the ultimate authority on production stability and the final gatekeeper for all deployments across the global GridOS SaaS portfolio.
  • This role requires designing standardized cloud infrastructure and a 'Golden Path' delivery platform using Backstage, ArgoCD, and GitHub Actions to eliminate bespoke methodologies.
  • The successful candidate must lead high-severity incident command, facilitate blameless root cause analysis, and manage end-to-end disaster recovery strategies.

Matching Summary

The Senior Manager will serve as the ultimate authority on production stability and the final gatekeeper for all deployments across the global GridOS SaaS portfolio.

Skills & Requirements

Must-have

  • AWS core services expertise
  • Kubernetes internals mastery
  • ArgoCD and GitOps workflows
  • Terraform Infrastructure as Code
  • Prometheus and Grafana observability
  • Incident Command leadership
  • SLO/SLI target establishment

Nice-to-have

  • Blameless culture facilitation
  • NERC CIP compliance knowledge
  • FinOps and capacity planning
  • VP-level stakeholder influence
  • Global team mentorship experience
  • SOC2 and ISO 27001 standards
  • Follow-the-Sun architecture design

Key Requirements

  • 14+ years overall experience
  • 8-10 years in SRE or Platform Engineering
  • Masters degree in STEM OR Bachelor's + 10 years
  • Deep AWS and Kubernetes expertise
  • Proven organizational influence skills

Work Rights

Not specified

Tailored Resume

Cover Letter