Sre Lead

Haleon

**
Define and own observability coe charter
Establish governance frameworks for observability
Own enterprise datadog implementation end-to-end
** Haleon is seeking an experienced Site Reliability Engineering (SRE) Lead to establish an SRE Center of Excellence (CoE) and drive the observability practice across the organization using Datadog. The ideal candidate will have extensive experience in SRE or DevOps roles, strong technical skills, and the ability to influence cross-functional teams. The position offers a unique opportunity to shape Haleon's observability strategy and contribute to the company's mission of improving everyday health. **

Job Summary

  • We are seeking an experienced SRE professional to setup the SRE CoE at Haleon.
  • This highly strategic role will be responsible for defining, building, and driving the Observability practice across the enterprise, which includes establishing SRE principles, patterns, and governance frameworks with Datadog as the primary observability platform.
  • This role demands deep technical expertise, executive presence, and the ability to influence without authority across diverse engineering, infrastructure, and product teams.

Matching Summary

Match Score: 75

** Haleon is seeking an experienced Site Reliability Engineering (SRE) Lead to establish an SRE Center of Excellence (CoE) and drive the observability practice across the organization using Datadog. The ideal candidate will have extensive experience in SRE or DevOps roles, strong technical skills, and the ability to influence cross-functional teams. The position offers a unique opportunity to shape Haleon's observability strategy and contribute to the company's mission of improving everyday health. **

Skills & Requirements

Must-have

  • Define and own Observability COE charter
  • Establish governance frameworks for observability
  • Own enterprise Datadog implementation end-to-end
  • Architect unified observability stack
  • Define tiered incident severity model
  • Lead major incident command for P0/P1 events

Nice-to-have

  • Deep human understanding and trusted science
  • Agile, performance-focused culture
  • Co-creating an environment
  • Collaboration across matrixed organizations
  • Exposure to regulated industries

Key Requirements

  • 8+ years in SRE, DevOps, or Infrastructure roles
  • 3+ years in a leadership capacity
  • Deep hands-on experience with Datadog
  • Strong knowledge of cloud platforms (AWS, Azure, or GCP)
  • Proficiency in automation and configuration management tools
  • Solid understanding of CI/CD pipelines, distributed systems, and microservices architecture
  • Familiarity with scripting/programming languages (Python, Go, Shell, etc.)
  • Strong background in performance tuning, capacity planning, and resilience engineering
  • Experience with agile/scrum techniques

Work Rights

Not specified

Tailored Resume

Cover Letter