Bxti, Site Reliability Engineer - Data, Cloud & Developer Experience

Blackstone

London, United Kingdom
Sre methodologies adoption
Observability systems and pipelines
Monitoring and alerting implementation
Blackstone’s Site Reliability Engineering team is responsible for improving the reliability of systems and services to meet the needs of the business

Job Summary

  • Blackstone’s Site Reliability Engineering team is responsible for improving the reliability of systems and services to meet the needs of the business.
  • This position involves the selection, implementation, and maintenance of key observability tooling, requiring ongoing evaluation of the firm’s needs in observability, monitoring, alerting, resilience, and recovery.
  • We aim to eliminate manual work, improve operational efficiency, and ensure the high quality outputs in all that we do.

Matching Summary

Blackstone’s Site Reliability Engineering team is responsible for improving the reliability of systems and services to meet the needs of the business.

Skills & Requirements

Must-have

  • SRE methodologies adoption
  • observability systems and pipelines
  • monitoring and alerting implementation
  • incident response and postmortems
  • automation for system management
  • Linux, Windows, and Networking troubleshooting

Nice-to-have

  • collaboration with diverse teams
  • continuous improvement mindset
  • blameless culture fostering
  • sense of shared ownership

Key Requirements

  • Python, C#, Typescript coding ability
  • AWS experience required
  • Azure experience preferred
  • Terraform, Puppet, Gitlab CI proficiency
  • Docker and container schedulers experience
  • Grafana, Prometheus, Splunk experience

Work Rights

Not specified

Tailored Resume

Cover Letter