Senior Site Reliability Engineer - Observability

DIMENSIONAL

Multiple Locations
Hybrid
Elk stack expertise
Grafana administration
Observability principles
You will be responsible for the reliability, scalability, and continued evolution of the observability platforms

Job Summary

  • You will be responsible for the reliability, scalability, and continued evolution of the observability platforms.
  • Roughly half your time will be spent on steady-state operations and platform support, and the other half on engineering projects.
  • Dimensional offers a variety of programs to help take care of you, your family, and your career, including comprehensive benefits, educational initiatives, and special celebrations.

Matching Summary

You will be responsible for the reliability, scalability, and continued evolution of the observability platforms.

Skills & Requirements

Must-have

  • ELK Stack expertise
  • Grafana administration
  • Observability principles
  • On-premises infrastructure operations
  • Python for automation
  • Linux systems knowledge
  • Incident resolution

Nice-to-have

  • Prometheus experience
  • New Relic administration
  • Log shipping tools
  • Distributed tracing tools
  • Cloud observability offerings
  • Observability standards governance

Key Requirements

  • 5+ years SRE/DevOps/Platform Engineering
  • Bachelor's degree or equivalent experience
  • ELK Stack deep hands-on experience
  • Strong Grafana experience
  • Proficiency in Python
  • Strong Linux knowledge
  • Comfort with configuration management tools

Work Rights

Not specified

Tailored Resume

Cover Letter