Site Reliability Engineer - Flutter Functions, Hybrid & Remote
Betfair Romania
Cluj, Romania
Hybrid
Sre best practices
Aws cloud tenancy
Monitoring and observability tools
Ensure the reliability, availability, and performance of critical gaming and betting platforms across global operations, maintaining 24/7/365 service availability for millions of customers worldwide
Job Summary
Ensure the reliability, availability, and performance of critical gaming and betting platforms across global operations, maintaining 24/7/365 service availability for millions of customers worldwide.
Implement and maintain enterprise-grade observability, disaster recovery, and business continuity capabilities across the AWS Cloud tenancy, including ownership of tooling infrastructure.
Benefits include hybrid & remote working options, self-development budget, company share scheme, generous annual leave, and opportunities for international work and customized well-being programs.
Matching Summary
Ensure the reliability, availability, and performance of critical gaming and betting platforms across global operations, maintaining 24/7/365 service availability for millions of customers worldwide.
Skills & Requirements
Must-have
SRE best practices
AWS Cloud tenancy
monitoring and observability tools
Python, Go, Bash, TypeScript, or Terraform
CI/CD pipelines and tools
containerization technologies Docker and Kubernetes
Nice-to-have
passion for system reliability
proactive approach to identifying issues
collaboration with development teams
strategic thinking
strategic communication
Key Requirements
Extensive experience with monitoring and observability tools
Demonstrated ability to work with cloud platforms (AWS, Azure, GCP)