Site Reliability Engineer (sre)

Acquire Asia Pacific Pty

Taguig City, Philippines
Prometheus, grafana, pagerduty monitoring
Infrastructure as code with pulumi
Aws eks, msk, singlestore, mongodb
The Site Reliability Engineer serves as the guardian of our production systems, ensuring the reliability, scalability, and performance of our IoT telemetry platform

Job Summary

  • The Site Reliability Engineer serves as the guardian of our production systems, ensuring the reliability, scalability, and performance of our IoT telemetry platform.
  • Responsibilities include defining and enforcing SLOs, automating operational processes, and building infrastructure and tooling to enable engineering teams to deploy with confidence.
  • The role involves participating in a follow-the-sun on-call rotation providing 24x7 support coverage across multiple time zones.

Matching Summary

The Site Reliability Engineer serves as the guardian of our production systems, ensuring the reliability, scalability, and performance of our IoT telemetry platform.

Skills & Requirements

Must-have

  • Prometheus, Grafana, PagerDuty monitoring
  • Infrastructure as Code with Pulumi
  • AWS EKS, MSK, SingleStore, MongoDB
  • Incident response and post-mortem leadership
  • Security patch pipelines and vulnerability remediation
  • SOC2 and ISO 27001 compliance support

Nice-to-have

  • Teamwork and innovation culture
  • Data-driven decision making
  • Eliminate operational toil
  • Continuous improvement of on-call experience

Key Requirements

  • Must have experience with incident command
  • Must have experience with IaC solutions
  • Must have experience with AWS services
  • Must participate in on-call rotation

Work Rights

Not specified

Tailored Resume

Cover Letter