Senior Observability & Telemetry Engineer - Radian Arc

Submer

EMEA
**
Design and build observability platform
Low-latency, high-scale telemetry pipelines
Collect, process, and analyze metrics, logs, and traces
** Submer is seeking a Senior Observability & Telemetry Engineer for its Radian Arc platform, focused on building observability systems for large-scale GPU cloud infrastructure. This remote position entails designing telemetry pipelines and observability architecture to enhance system performance and reliability in edge computing environments. **

Job Summary

  • Design and build the observability platform that powers visibility, reliability, and performance insights for large-scale GPU cloud infrastructure as well as smaller edge deployments.
  • You will design and operate low-latency, high-scale telemetry pipelines that collect, process, and analyze metrics, logs, and traces from infrastructure running across core datacenter clusters and smaller edge deployments.
  • As a senior engineer, you will lead delivery of major observability initiatives, contribute to the evolution of telemetry standards and SLO implementation, and work with other teams to ensure observability is effectively integrated into the platform architecture from infrastructure to application layers.

Matching Summary

Match Score: 75

** Submer is seeking a Senior Observability & Telemetry Engineer for its Radian Arc platform, focused on building observability systems for large-scale GPU cloud infrastructure. This remote position entails designing telemetry pipelines and observability architecture to enhance system performance and reliability in edge computing environments. **

Skills & Requirements

Must-have

  • design and build observability platform
  • low-latency, high-scale telemetry pipelines
  • collect, process, and analyze metrics, logs, and traces
  • instrument GPU clusters and inference workloads
  • network observability platforms
  • telemetry collectors and exporters using Python or Go
  • design advanced alerting and anomaly detection systems

Nice-to-have

  • customer-facing observability experiences
  • contribute to SLO implementation
  • automated reliability mechanisms
  • performance analysis tools

Key Requirements

  • Senior Observability & Telemetry Engineer
  • EMEA (remote)

Work Rights

Not specified

Tailored Resume

Cover Letter