Principal Site Reliability Engineer, Infrastructure Observability

T. Rowe Price UK

Owings Mills, MD, United States
Base: $159,000.00 - $339,000.00; bonus/equity: ann...
Hybrid
Infrastructure observability
Cloud & on-prem solutions
Devops practices
As a Principal Site Reliability Engineer, Infrastructure Observability, you will help formulate, develop, and implement a team of Site Reliability Engineers (SREs) focused on the observability, sustainability, scalability, measurability and recoverability of T. Rowe Price’s innovative cloud & on-prem solutions by leveraging automation and best-of-breed tools

Job Summary

  • As a Principal Site Reliability Engineer, Infrastructure Observability, you will help formulate, develop, and implement a team of Site Reliability Engineers (SREs) focused on the observability, sustainability, scalability, measurability and recoverability of T. Rowe Price’s innovative cloud & on-prem solutions by leveraging automation and best-of-breed tools.
  • The successful candidate will have a strong operations & engineering background, is hands-on when needed, and has expertise in the cloud environments (public, private), infrastructure operations, DevOps practices, CI/CD toolchain and systems, code build and deployment, incident response, and 24x7 monitoring and support.
  • You’ll enjoy resources to support your career path, as well as compensation, benefits, and flexibility to enrich your life.

Matching Summary

As a Principal Site Reliability Engineer, Infrastructure Observability, you will help formulate, develop, and implement a team of Site Reliability Engineers (SREs) focused on the observability, sustainability, scalability, measurability and recoverability of T. Rowe Price’s innovative cloud & on-prem solutions by leveraging automation and best-of-breed tools.

Salary

Base: $159,000.00 - $339,000.00; Bonus/Equity: Annual bonus eligibility; Benefits: Competitive compensation, generous retirement plan, health and wellness benefits, paid time off, family care resources

Skills & Requirements

Must-have

  • Infrastructure Observability
  • Cloud & On-Prem Solutions
  • DevOps practices
  • CI/CD toolchain
  • Incident response
  • 24x7 monitoring and support

Nice-to-have

  • Collaborative culture
  • Spirit of generosity
  • Strategic growth
  • Blameless post-mortems

Key Requirements

  • 10+ years of experience designing and operating cloud infrastructure
  • 5+ years building and supporting solutions in Amazon AWS
  • 5+ years of experience building and running a DevOps and/or SRE function
  • Experience with implementation and operation of the chaos model at scale
  • Fluent in multiple programming languages
  • Proficiency with database development
  • Proficiency with defining, right-sizing, tracking, and reporting on SLOs/SLIs
  • Experience with implementing and managing Error Budgets
  • Knowledge/experience driving dashboard standardization
  • Knowledge/experience with observability tools
  • Knowledge/experience with cloud management tools

Work Rights

Not specified

Tailored Resume

Cover Letter