This role focuses on designing and implementing observability strategies that provide deep insight into system health, performance, and user experience
Job Summary
This role focuses on designing and implementing observability strategies that provide deep insight into system health, performance, and user experience.
You will work closely with platform engineers, SREs, and application teams to instrument services, implement telemetry standards, and build actionable monitoring that enables proactive incident prevention and faster troubleshooting.
This role blends SRE fundamentals, telemetry engineering, and application-level instrumentation.
Matching Summary
This role focuses on designing and implementing observability strategies that provide deep insight into system health, performance, and user experience.
Skills & Requirements
Must-have
Observability strategy and platform ownership
Metrics, monitoring, and alerting
Logging and log intelligence
Distributed tracing and telemetry instrumentation
Application-level observability
CI/CD and observability integration
Performance and reliability insights
Nice-to-have
Distinguish signal vs noise
Think in telemetry and system behavior
Collaborate with developers
Design proactive monitoring
Key Requirements
2–5 years experience in Observability, SRE, DevOps, or Platform Engineering
Bachelor’s degree in Computer Science or related field