Senior Site Reliability - Workflow Automation

Dimensional Fund Advisors

Multiple Locations
Hybrid
Apache airflow expertise
Enterprise job scheduling platforms
Python for automation
You will own the reliability, scalability, and operational excellence of workflow orchestration platforms, primarily Apache Airflow and Broadcom Automic/UC4

Job Summary

  • You will own the reliability, scalability, and operational excellence of workflow orchestration platforms, primarily Apache Airflow and Broadcom Automic/UC4.
  • Roughly half your time will be spent on steady state operations and incident response, and the other half on engineering projects that meaningfully improve the platforms you support.
  • Dimensional offers a variety of programs to help take care of you, your family, and your career, including comprehensive benefits, educational initiatives, and special celebrations of our history, culture, and growth.

Matching Summary

You will own the reliability, scalability, and operational excellence of workflow orchestration platforms, primarily Apache Airflow and Broadcom Automic/UC4.

Skills & Requirements

Must-have

  • Apache Airflow expertise
  • Enterprise job scheduling platforms
  • Python for automation
  • Kubernetes and Docker
  • Observability principles and tools
  • Incident resolution and communication

Nice-to-have

  • Automic/UC4 experience
  • Managed Airflow offerings
  • Data platform ecosystems
  • Legacy scheduler migration
  • Error budget and SLO management

Key Requirements

  • 5+ years of experience in SRE, DevOps, or platform engineering
  • Bachelor’s degree in a technical field or equivalent practical experience
  • Strong Linux and Windows systems knowledge
  • Cloud environments (AWS preferred)

Work Rights

Not specified

Tailored Resume

Cover Letter