Manager, Site Reliability Engineering

DIMENSIONAL

Austin, Texas, USA
Fully remote
Elk, prometheus, grafana expertise
Python-based service development
Linux administration and ci/cd
Lead a global team of SREs, driving professional growth and operational excellence through coaching and mentorship

Job Summary

  • Lead a global team of SREs, driving professional growth and operational excellence through coaching and mentorship.
  • Own the monitoring strategy, service health, and performance indicators through dashboarding and alerting.
  • Relentlessly pursue opportunities to eradicate toil through automation and build confidence in deployments through enhanced data quality assurance processes.

Matching Summary

Lead a global team of SREs, driving professional growth and operational excellence through coaching and mentorship.

Skills & Requirements

Must-have

  • ELK, Prometheus, Grafana expertise
  • Python-based service development
  • Linux administration and CI/CD
  • Airflow, dbt, Snowflake experience
  • Automated testing capability
  • Cross-functional collaboration skills

Nice-to-have

  • Open-minded, curious, resourceful
  • Lead with vision and purpose
  • Passionate about modern technologies
  • Solve problems systematically and transparently
  • Share ideas and integrate feedback

Key Requirements

  • Managerial experience leading SRE teams
  • Experience running software projects end-to-end
  • Demonstrated self-organization and strong communication

Work Rights

Not specified

Tailored Resume

Cover Letter