The Senior Platform Site Reliability Engineer ensures the reliability, scalability, and availability of NAS AI Ecosystem platforms
Job Summary
The Senior Platform Site Reliability Engineer ensures the reliability, scalability, and availability of NAS AI Ecosystem platforms.
This role combines software engineering and operations to automate platform operations, improve observability, and maintain stable production environments for AI, data, and backend services.
We offer flexibility in your schedule, empowering you to balance life’s demands, while also maintaining your ability to serve clients.
Matching Summary
The Senior Platform Site Reliability Engineer ensures the reliability, scalability, and availability of NAS AI Ecosystem platforms.
Skills & Requirements
Must-have
Python, Go, or Bash proficiency
Docker and Kubernetes experience
Prometheus, Grafana, Azure Monitor, or ELK
Terraform, ARM, or CloudFormation
Networking and distributed systems understanding
CI/CD pipelines and deployment strategies
Nice-to-have
AI/ML or data platforms support
Chaos engineering and resiliency testing
High-availability, multi-region systems
Key Requirements
Site Reliability Engineering, DevOps, or Platform Engineering experience