Site Reliability Engineer Senior 1

RSM UK

Python go or bash proficiency
Azure aws or gcp cloud experience
Docker and kubernetes hands-on skills
The Senior Platform Site Reliability Engineer ensures the reliability, scalability, and availability of NAS AI Ecosystem platforms

Job Summary

  • The Senior Platform Site Reliability Engineer ensures the reliability, scalability, and availability of NAS AI Ecosystem platforms.
  • You will implement reliability engineering practices, define SLIs/SLOs/SLAs, and automate operational processes to reduce manual effort.
  • RSM offers a competitive benefits package with schedule flexibility to help you balance life's demands while serving clients.

Matching Summary

The Senior Platform Site Reliability Engineer ensures the reliability, scalability, and availability of NAS AI Ecosystem platforms.

Skills & Requirements

Must-have

  • Python Go or Bash proficiency
  • Azure AWS or GCP cloud experience
  • Docker and Kubernetes hands-on skills
  • Prometheus Grafana monitoring tools
  • Terraform infrastructure as code
  • Networking and distributed systems knowledge

Nice-to-have

  • AI ML platform support experience
  • Chaos engineering and resiliency testing
  • Cloud or Kubernetes certifications
  • High-availability multi-region systems
  • Inclusive culture and talent experience

Key Requirements

  • Minimum SRE DevOps or Platform Engineering experience
  • Bachelor's degree required
  • Experience with CI/CD pipelines and deployment strategies

Work Rights

Not specified

Tailored Resume

Cover Letter