Sre Lead

Haleon

Bengaluru, India
Datadog observability platform
Sre principles and patterns
Define error budgets and reliability targets
This highly strategic role will be responsible for defining, building, and driving the Observability practice across the enterprise, which includes establishing SRE principles, patterns, and governance frameworks with Datadog as the primary observability platform

Job Summary

  • This highly strategic role will be responsible for defining, building, and driving the Observability practice across the enterprise, which includes establishing SRE principles, patterns, and governance frameworks with Datadog as the primary observability platform.
  • Own the enterprise Datadog implementation end-to-end: architecture, integrations, cost optimization, and feature enablement.
  • Define and operationalize a tiered incident severity model with clear escalation paths, SLAs, and communication protocols.

Matching Summary

This highly strategic role will be responsible for defining, building, and driving the Observability practice across the enterprise, which includes establishing SRE principles, patterns, and governance frameworks with Datadog as the primary observability platform.

Skills & Requirements

Must-have

  • Datadog observability platform
  • SRE principles and patterns
  • Define error budgets and reliability targets
  • Unified observability stack architecture
  • Incident management and AIOps

Nice-to-have

  • Collaborate across matrixed organizations
  • Agile working culture
  • Modern service management frameworks

Key Requirements

  • 8+ years SRE/DevOps/Infrastructure experience
  • 3+ years leadership capacity
  • Hands-on Datadog experience
  • Cloud platforms (AWS, Azure, GCP) proficiency
  • Automation and configuration management tools
  • CI/CD pipelines and microservices architecture
  • Scripting/programming languages

Work Rights

Not specified

Tailored Resume

Cover Letter