Sre Observability Slo Engineer

GE VERNOVA

Fully remote
Sre and observability engineering
Kubernetes observability and metrics
Sli and slo implementation
The Observability & SLO Engineer will build and own the full telemetry stack to provide real-time reliability confidence for mission-critical energy management systems

Job Summary

  • The Observability & SLO Engineer will build and own the full telemetry stack to provide real-time reliability confidence for mission-critical energy management systems.
  • This role involves establishing initial observability coverage and then driving ongoing improvements aligned with product releases and customer onboarding.
  • The position offers relocation assistance and is fully remote, supporting a collaborative and high-impact team environment.

Matching Summary

The Observability & SLO Engineer will build and own the full telemetry stack to provide real-time reliability confidence for mission-critical energy management systems.

Skills & Requirements

Must-have

  • SRE and observability engineering
  • Kubernetes observability and metrics
  • SLI and SLO implementation
  • Telemetry standards and architecture
  • Alerting strategies with noise reduction
  • AWS CloudWatch and synthetic monitoring
  • Python or Bash scripting for automation

Nice-to-have

  • Familiarity with OpenTelemetry instrumentation
  • Experience with chaos engineering practices
  • Knowledge of AIOps and ML anomaly detection
  • Experience in regulated industries
  • Strong Linux administration skills
  • Leadership and influencing skills
  • Passionate about continuous learning

Key Requirements

  • 2–3 years in SRE or observability roles
  • Experience with major observability platforms
  • Bachelor's degree in STEM or related field
  • Proficiency in query/visualization languages
  • AWS certifications preferred
  • Strong scripting skills in Python and/or Bash

Work Rights

Not specified

Tailored Resume

Cover Letter