Site Reliability Engineer Senior 1

RS Group

Gurugram, India
Python, go, or bash proficiency
Docker and kubernetes experience
Prometheus, grafana, azure monitor, or elk
The Senior Platform Site Reliability Engineer ensures the reliability, scalability, and availability of NAS AI Ecosystem platforms

Job Summary

  • The Senior Platform Site Reliability Engineer ensures the reliability, scalability, and availability of NAS AI Ecosystem platforms.
  • This role combines software engineering and operations to automate platform operations, improve observability, and maintain stable production environments for AI, data, and backend services.
  • We offer flexibility in your schedule, empowering you to balance life’s demands, while also maintaining your ability to serve clients.

Matching Summary

The Senior Platform Site Reliability Engineer ensures the reliability, scalability, and availability of NAS AI Ecosystem platforms.

Skills & Requirements

Must-have

  • Python, Go, or Bash proficiency
  • Docker and Kubernetes experience
  • Prometheus, Grafana, Azure Monitor, or ELK
  • Terraform, ARM, or CloudFormation
  • Networking and distributed systems understanding
  • CI/CD pipelines and deployment strategies

Nice-to-have

  • AI/ML or data platforms support
  • Chaos engineering and resiliency testing
  • High-availability, multi-region systems

Key Requirements

  • Site Reliability Engineering, DevOps, or Platform Engineering experience
  • Azure, AWS, or GCP experience
  • Bachelor’s degree
  • Cloud or Kubernetes certifications

Work Rights

Not specified

Tailored Resume

Cover Letter