SHORE Solutions Inc is seeking a Site Reliability Engineer (SRE) in Taguig City, Philippines, to oversee the reliability, scalability, and performance of their IoT telemetry platform. The role focuses on defining Service Level Objectives (SLOs), automating operational processes, and enhancing incident response and security compliance
Job Summary
The Site Reliability Engineer serves as the guardian of our production systems, ensuring the reliability, scalability, and performance of our IoT telemetry platform.
Responsibilities include defining and enforcing SLOs, automating operational processes, and building infrastructure and tooling for engineering teams.
Participate in a follow-the-sun on-call rotation providing 24x7 support across multiple time zones.
Matching Summary
Match Score: 85
SHORE Solutions Inc is seeking a Site Reliability Engineer (SRE) in Taguig City, Philippines, to oversee the reliability, scalability, and performance of their IoT telemetry platform. The role focuses on defining Service Level Objectives (SLOs), automating operational processes, and enhancing incident response and security compliance.
Skills & Requirements
Must-have
SLO and error budget management
Prometheus, Grafana, PagerDuty monitoring
Infrastructure as Code with Pulumi
AWS EKS, MSK, SingleStore, MongoDB S3
Incident command and post-mortems
IAM policy enforcement
Security patch and vulnerability remediation
Nice-to-have
Automate operational processes to eliminate toil
Support SOC2 and ISO 27001 compliance
Follow-the-sun on-call rotation
Key Requirements
Experience with Infrastructure as Code (IaC)
Proficiency in AWS services
Experience with monitoring and alerting tools
Experience with incident response procedures
Experience with security and compliance initiatives