G13 - Operations Support Engineer

FPT ASIA PACIFIC PTE. LTD.

Singapore, Singapore
Not specified; not specified; not specified
Not specified
4+ years sre or production ops experience
Aws and kubernetes troubleshooting knowledge
Elastic cloud observability implementation
FPT ASIA PACIFIC PTE. LTD. is seeking an Operations Support Engineer in Singapore to design and manage service observability, optimize performance, and enforce security controls for cloud services. The ideal candidate will have over four years of experience in Site Reliability Engineering or related fields, with a strong background in AWS, Kubernetes, and observability tools

Job Summary

  • The role involves designing and owning the service observability usage model to ensure all metrics, logs, and traces flow into Elastic Cloud.
  • Candidates will build proactive alerting systems and drive post-incident root cause analysis to meet closure SLAs.
  • The position requires implementing secure supply chain controls and integrating telemetry for model services into unified views.

Matching Summary

Match Score: 85

FPT ASIA PACIFIC PTE. LTD. is seeking an Operations Support Engineer in Singapore to design and manage service observability, optimize performance, and enforce security controls for cloud services. The ideal candidate will have over four years of experience in Site Reliability Engineering or related fields, with a strong background in AWS, Kubernetes, and observability tools.

Salary

Not specified; Not specified; Not specified

Skills & Requirements

Must-have

  • 4+ years SRE or Production Ops experience
  • AWS and Kubernetes troubleshooting knowledge
  • Elastic Cloud observability implementation
  • Python, Bash, or Go scripting skills
  • Incident management and RCA authorship

Nice-to-have

  • Experience with OPA or Kyverno policy-as-code
  • Mentoring engineers on operational excellence
  • Familiarity with Terraform and Argo GitOps
  • Understanding of GPU/CPU utilization metrics
  • Proven track record in burnout prevention

Key Requirements

  • 4+ years in SRE, Production Ops, or Platform roles
  • Working knowledge of AWS and Kubernetes
  • Proficiency in Python, Bash, or Go scripting
  • Proven experience in on-call incident management

Work Rights

Not specified

Tailored Resume

Cover Letter