Site Reliability Engineer

Barclays

Pune, India
AWS
Kubernetes (ecs)
Python, bash, json/yaml
Apply software engineering techniques, automation, and best practices in incident response to ensure the reliability, availability, and scalability of systems, platforms, and technology

Job Summary

  • Apply software engineering techniques, automation, and best practices in incident response to ensure the reliability, availability, and scalability of systems, platforms, and technology.
  • Develop tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience.
  • Collaborate with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle.

Matching Summary

Apply software engineering techniques, automation, and best practices in incident response to ensure the reliability, availability, and scalability of systems, platforms, and technology.

Skills & Requirements

Must-have

  • AWS
  • Kubernetes (ECS)
  • Python, Bash, JSON/Yaml
  • High availability, fault-tolerant systems
  • Disaster recovery, zero downtime solutions
  • Continuous delivery

Nice-to-have

  • Azure, GCP
  • Fargate, GCE
  • ForgeRock COTS based IAM solutions
  • Risk and controls
  • Business acumen
  • Strategic thinking

Key Requirements

  • Experience with AWS
  • Experience with Kubernetes (ECS)
  • Coding in Python, Bash and JSON/Yaml
  • Experience running disaster recovery
  • Experience designing continuous delivery

Work Rights

Not specified

Tailored Resume

Cover Letter