Site Reliability Engineer

EXASOFT PTE. LTD.

Singapore, Singapore
**
3-5 years relevant experience
Advanced aws services knowledge
Terraform or cloudformation proficiency
** EXASOFT PTE. LTD. is seeking a Site Reliability Engineer (SRE) to ensure the reliability, availability, and performance of their systems and services. The role requires expertise in AWS and software engineering principles, focusing on automation and monitoring to enhance operational excellence. **

Job Summary

  • The Site Reliability Engineer ensures the reliability, availability, and performance of systems using software engineering principles to operations.
  • Candidates must own end-to-end system reliability with clearly defined SLAs, SLOs, and SLIs while establishing error budget policies.
  • The role involves leading major incident response efforts and standardizing observability across environments using robust monitoring frameworks.

Matching Summary

Match Score: 75

** EXASOFT PTE. LTD. is seeking a Site Reliability Engineer (SRE) to ensure the reliability, availability, and performance of their systems and services. The role requires expertise in AWS and software engineering principles, focusing on automation and monitoring to enhance operational excellence. **

Skills & Requirements

Must-have

  • 3-5 years relevant experience
  • Advanced AWS services knowledge
  • Terraform or CloudFormation proficiency
  • Multi-AZ multi-region architecture design
  • Dynatrace CloudWatch OpenTelemetry tools

Nice-to-have

  • Blameless postmortem culture
  • Error budget policy governance
  • Modular IaC design patterns
  • Well-architected framework pillars
  • Horizontal scalability expertise

Key Requirements

  • 3-5 years relevant experience
  • AWS EC2 ECS EKS Lambda experience
  • Strong networking understanding in AWS
  • Hands-on Terraform or CDK experience

Work Rights

Not specified

Tailored Resume

Cover Letter