Senior Site Reliability Engineer - Observability

Dimensional Fund Advisors

Multiple Locations
Hybrid
Elk stack operations
Grafana dashboard design
Observability principles
You’ll be responsible for the reliability, scalability, and continued evolution of the tools that give our engineering organization visibility into everything they build and run

Job Summary

  • You’ll be responsible for the reliability, scalability, and continued evolution of the tools that give our engineering organization visibility into everything they build and run.
  • Roughly half your time will be spent on steady-state operations and platform support, and the other half on engineering projects that meaningfully advance the platforms you support.
  • Dimensional offers a variety of programs to help take care of you, your family, and your career, including comprehensive benefits, educational initiatives, and special celebrations of our history, culture, and growth.

Matching Summary

You’ll be responsible for the reliability, scalability, and continued evolution of the tools that give our engineering organization visibility into everything they build and run.

Skills & Requirements

Must-have

  • ELK Stack operations
  • Grafana dashboard design
  • observability principles
  • on-premises infrastructure operation
  • Python for automation
  • Linux systems knowledge

Nice-to-have

  • Prometheus experience
  • New Relic administration
  • distributed tracing tools
  • cloud-based observability offerings
  • governing observability standards

Key Requirements

  • 5+ years of experience in SRE, DevOps, or platform engineering
  • Bachelor’s degree in a technical field or equivalent practical experience
  • Proficiency in Python
  • Comfort working with configuration management tools

Work Rights

Not specified

Tailored Resume

Cover Letter