Sre Observability & Slo Engineer

GE Vernova

Not specified; not specified; relocation assistanc...
3-5 years sre or observability experience
Deep expertise in datadog or grafana
Hands-on kubernetes (eks/rancher) observability
This role involves building and owning the full telemetry stack to provide real-time confidence in mission-critical energy management systems

Job Summary

  • This role involves building and owning the full telemetry stack to provide real-time confidence in mission-critical energy management systems.
  • The engineer will define meaningful Service Level Indicators and Objectives while governing the review cycle to drive reliability work prioritization.
  • Relocation assistance is provided for this high-impact position within GE Vernova's GridOS Platform Engineering team.

Matching Summary

This role involves building and owning the full telemetry stack to provide real-time confidence in mission-critical energy management systems.

Salary

Not specified; Not specified; Relocation Assistance Provided

Skills & Requirements

Must-have

  • 3-5 years SRE or observability experience
  • Deep expertise in Datadog or Grafana
  • Hands-on Kubernetes (EKS/Rancher) observability
  • Experience defining SLIs and SLOs with error budgets
  • Proficiency in PromQL or Datadog Query Language
  • Scripting skills in Python or Bash

Nice-to-have

  • Familiarity with OpenTelemetry instrumentation
  • Experience with AWS CloudWatch Synthetics
  • Knowledge of chaos engineering practices
  • Exposure to AIOps or ML-driven anomaly detection
  • Experience in regulated energy or utility industries
  • AWS certifications in CloudWatch or Solutions Architect

Key Requirements

  • Bachelor's Degree in Computer Science or STEM
  • 3-5 years in SRE or infrastructure reliability roles
  • Strong Linux administration skills

Work Rights

Not specified

Tailored Resume

Cover Letter