[s3ns] Sre Monitoring & Observability (h/f)

Thales Group

Not specified (assumed hybrid based on industry norms)
Prometheus mimir grafana loki
Kubernetes on-prem management
Sla slo sli error budget
Thales Group is seeking an SRE Monitoring & Observability Engineer to join their S3NS team, a collaboration with Google Cloud focused on providing secure cloud solutions. The ideal candidate should possess a strong background in Site Reliability Engineering, particularly with monitoring stacks and Kubernetes, and have a minimum of three years of relevant experience

Job Summary

  • The role involves maintaining and evolving the monitoring stack for S3NS infrastructure on-premises in partnership with Google Cloud.
  • Candidates are expected to ensure availability commitments through rigorous SLI, SLO, and SLA tracking while participating in on-call rotations.
  • The position requires automating operational tasks via scripts and contributing to broader platform initiatives including IaaS and KaaS.

Matching Summary

Match Score: 85

Thales Group is seeking an SRE Monitoring & Observability Engineer to join their S3NS team, a collaboration with Google Cloud focused on providing secure cloud solutions. The ideal candidate should possess a strong background in Site Reliability Engineering, particularly with monitoring stacks and Kubernetes, and have a minimum of three years of relevant experience.

Skills & Requirements

Must-have

  • Prometheus Mimir Grafana Loki
  • Kubernetes on-prem management
  • SLA SLO SLI error budget
  • Incident response and post-mortems
  • CICD pipeline automation

Nice-to-have

  • Stress management during incidents
  • Clear communication skills
  • Complex problem solving mindset

Key Requirements

  • Bachelor's degree or equivalent (Bac+5)
  • Minimum 3 years of relevant experience
  • Proficiency in SRE concepts and Kubernetes

Work Rights

Not specified

Tailored Resume

Cover Letter