Principal Site Reliability Engineer - Observability

Elastic

Spain, Spain
On-site
Operating large-scale production services
Elastic platform capabilities
Coding agents
Elastic is seeking a Principal Site Reliability Engineer specializing in Observability to enhance their Infrastructure Observability solutions. The ideal candidate should have a strong SRE background with experience in large-scale production environments and proficiency in Kubernetes and observability tools

Job Summary

  • Collaborate with product management, product design, customers and multiple teams across Elastic (especially our own SRE teams) in defining and evolving the end-to-end InfraObs experiences that enable both human and agentic users.
  • Deliver and continually evolve the experiences leveraging the Elastic Platform capabilities and coding agents.
  • We are looking for engineers with a SRE background and experience operating large-scale production services with the help of Observability tools.

Matching Summary

Match Score: 85

Elastic is seeking a Principal Site Reliability Engineer specializing in Observability to enhance their Infrastructure Observability solutions. The ideal candidate should have a strong SRE background with experience in large-scale production environments and proficiency in Kubernetes and observability tools.

Skills & Requirements

Must-have

  • operating large-scale production services
  • Elastic Platform capabilities
  • coding agents
  • AI coding agents

Nice-to-have

  • foster a culture of mutual respect
  • sincerely empathizes with others

Key Requirements

  • Proficiency operating production infrastructure in K8s
  • Proficiency using Observability tools
  • Working with a high level of autonomy
  • Excellent verbal and written communication skills

Work Rights

Not specified

Tailored Resume

Cover Letter