Staff Site Reliability Engineer

Arcadia

Chennai, India
On-site
10-14 years sre or devops experience
Deep hands-on aws expertise including eks and vpc
Advanced terraform skills for iac management
Arcadia is an AI-powered energy intelligence platform trusted by Fortune 2000 companies to manage utility bills and advance sustainability

Job Summary

  • Arcadia is an AI-powered energy intelligence platform trusted by Fortune 2000 companies to manage utility bills and advance sustainability.
  • This Staff SRE role serves as a technical anchor in the India timezone, owning multi-week projects from problem statement to production.
  • The successful candidate will drive architectural decisions across AWS, Kubernetes, and CI/CD pipelines while mentoring engineers and collaborating with US leadership.

Matching Summary

Arcadia is an AI-powered energy intelligence platform trusted by Fortune 2000 companies to manage utility bills and advance sustainability.

Skills & Requirements

Must-have

  • 10-14 years SRE or DevOps experience
  • Deep hands-on AWS expertise including EKS and VPC
  • Advanced Terraform skills for IaC management
  • Kubernetes troubleshooting and cluster upgrades
  • CI/CD pipeline design with Jenkins and ArgoCD
  • Observability stack implementation using Prometheus
  • Proven mentorship and technical leadership ability

Nice-to-have

  • FinOps practices for cloud cost optimization
  • Secrets management platforms like HashiCorp Vault
  • Event-driven architecture with AWS Lambda
  • AI-enabled tooling and LLM-based debugging
  • Experience with MySQL and PostgreSQL reliability

Key Requirements

  • 10-14 years of SRE/DevOps/Cloud Engineering experience
  • Demonstrated project ownership and end-to-end delivery
  • Strong written and verbal communication skills
  • Ability to operate autonomously without daily direction

Work Rights

Not specified

Tailored Resume

Cover Letter