This role is pivotal to defining and delivering the observability strategy to make systems measurable, reliable, and continually improving
Job Summary
This role is pivotal to defining and delivering the observability strategy to make systems measurable, reliable, and continually improving.
You will design and operate scalable telemetry pipelines for metrics, logs, traces, and events across distributed systems using tools like OpenTelemetry and Kafka.
The position offers a flexible hybrid working model within a culture that values innovation, ownership, and continuous learning at an enterprise scale.
Matching Summary
This role is pivotal to defining and delivering the observability strategy to make systems measurable, reliable, and continually improving.
Skills & Requirements
Must-have
OpenTelemetry Kafka Cribl ClickHouse
Prometheus Grafana Splunk Elastic Loki
Python Go Java programming proficiency
Terraform Ansible infrastructure as code
Kubernetes microservices service meshes
SLI SLO definition and alerting strategies
API integrations secure token flows
Nice-to-have
Regulated financial services background
Open source observability contributions
Flexible hybrid working model
Culture of innovation and ownership
Continuous learning environment
Key Requirements
Success in observability SRE or platform engineering