Sre Lead

Haleon

**
Datadog observability platform
Sre principles and patterns
Define error budgets and reliability targets
** Haleon is seeking a Site Reliability Engineering (SRE) Lead to establish an SRE Center of Excellence and drive observability practices across the enterprise. The ideal candidate will have extensive experience in SRE, particularly with Datadog, and strong leadership skills to influence various teams. This role offers the opportunity to shape the future of a leading consumer health company focused on improving everyday health. **

Job Summary

  • This highly strategic role will be responsible for defining, building, and driving the Observability practice across the enterprise, which includes establishing SRE principles, patterns, and governance frameworks with Datadog as the primary observability platform.
  • Own the enterprise Datadog implementation end-to-end: architecture, integrations, cost optimization, and feature enablement.
  • Define and operationalize a tiered incident severity model with clear escalation paths, SLAs, and communication protocols.

Matching Summary

Match Score: 75

** Haleon is seeking a Site Reliability Engineering (SRE) Lead to establish an SRE Center of Excellence and drive observability practices across the enterprise. The ideal candidate will have extensive experience in SRE, particularly with Datadog, and strong leadership skills to influence various teams. This role offers the opportunity to shape the future of a leading consumer health company focused on improving everyday health. **

Skills & Requirements

Must-have

  • Datadog observability platform
  • SRE principles and patterns
  • Define error budgets and reliability targets
  • Unified observability stack architecture
  • Incident severity model definition
  • AIOps capabilities within Datadog

Nice-to-have

  • Agile working culture
  • Collaboration across matrixed organizations
  • Modern service management frameworks
  • Exposure to regulated industries

Key Requirements

  • 8+ years in SRE, DevOps, or Infrastructure roles
  • 3+ years in a leadership capacity
  • Deep hands-on Datadog experience
  • Strong cloud platform knowledge (AWS, Azure, GCP)
  • Proficiency in automation/configuration management tools
  • Solid understanding of CI/CD, distributed systems, microservices
  • Familiarity with scripting/programming languages
  • Strong background in performance tuning, capacity planning, resilience

Work Rights

Not specified

Tailored Resume

Cover Letter