Site Reliability Engineer 5

Adobe Media and Data Science Research (MDSR) Laboratory

Bangalore, India
Define and drive reliability and scalability strategy
Architect large-scale distributed systems
Build advanced automation frameworks
Define and drive the long-term reliability and scalability strategy for the Adobe Pass platform, aligning with product and business goals

Job Summary

  • Define and drive the long-term reliability and scalability strategy for the Adobe Pass platform, aligning with product and business goals.
  • Build and champion advanced automation frameworks that enable zero-touch operations across deployment, recovery, and scaling workflows.
  • Mentor and coach SREs and software engineers, cultivating deep reliability-first thinking across teams.

Matching Summary

Define and drive the long-term reliability and scalability strategy for the Adobe Pass platform, aligning with product and business goals.

Skills & Requirements

Must-have

  • Define and drive reliability and scalability strategy
  • Architect large-scale distributed systems
  • Build advanced automation frameworks
  • Introduce AI/ML-based predictive monitoring
  • Lead organization-wide reliability initiatives
  • Serve as technical authority during incidents
  • Lead large-scale performance tuning

Nice-to-have

  • Cultivating deep reliability-first thinking
  • Thought leader in reliability engineering
  • Experience in high-traffic systems
  • Familiarity with big data ecosystems
  • Hands-on experience with security and compliance

Key Requirements

  • 12+ years of experience in SRE/production engineering
  • Proven track record of managing highly available systems
  • Expert-level proficiency in Python, Go, Java, or Bash
  • Deep understanding of Kubernetes and microservices
  • Advanced experience with IaC and CI/CD
  • Mastery in observability and monitoring stacks
  • Strong expertise in networking and distributed databases

Work Rights

Not specified

Tailored Resume

Cover Letter