Site Reliability Engineer (sre) - Identity Access Management Iam
Barclays
Pune, India
Aws, kubernetes, ecs, python, bash, json/yaml
High availability, fault tolerance, auto-scaling
Disaster recovery, zero downtime solutions
Apply software engineering techniques, automation, and best practices in incident response to ensure the reliability, availability, and scalability of systems
Job Summary
Apply software engineering techniques, automation, and best practices in incident response to ensure the reliability, availability, and scalability of systems.
Develop tools and scripts to automate operational processes, reducing manual workload and improving system resilience.
Collaborate with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle.
Matching Summary
Apply software engineering techniques, automation, and best practices in incident response to ensure the reliability, availability, and scalability of systems.
Skills & Requirements
Must-have
AWS, Kubernetes, ECS, Python, Bash, JSON/Yaml
High availability, fault tolerance, auto-scaling
Disaster recovery, zero downtime solutions
Continuous delivery for microservices
SRE principles into DevSecOps lifecycle
Nice-to-have
ForgeRock COTS IAM solutions
PKI based self-sovereign identity
Azure, GCP, Fargate, GCE
Collaborative team culture
Industry technology trends
Key Requirements
Experience in designing, implementing, deploying, and running highly available systems
Strong expertise in AWS
Strong experience in running disaster recovery
Hands-on experience coding in Python, Bash and JSON/Yaml