Senior Site Reliability Engineer

2k

Austin, United States
On-site
Enterprise-wide observability platform
Monitoring solutions
Observability standards and best practices
This role will lead the Observability team that supports our game platform and studios across production and development environments

Job Summary

  • This role will lead the Observability team that supports our game platform and studios across production and development environments.
  • Architect, develop, and evolve our enterprise-wide observability platform to provide deep visibility into infrastructure and application performance.
  • Partner with developers and operations teams to enable self-service observability capabilities.

Matching Summary

This role will lead the Observability team that supports our game platform and studios across production and development environments.

Skills & Requirements

Must-have

  • enterprise-wide observability platform
  • monitoring solutions
  • observability standards and best practices
  • automation for monitoring configurations
  • unify metrics, logs, and traces
  • cost optimization initiatives
  • self-service observability capabilities
  • automation and alerting processes
  • architectural reviews
  • documentation and knowledge sharing
  • reports and visualizations
  • emerging technologies
  • system performance, resiliency, and insight quality
  • industry-leading monitoring and telemetry tools

Nice-to-have

  • building an Observability practice
  • developing software for highly scalable/distributed systems
  • IaC for highly elastic workloads
  • gaming or similar industries
  • common source code repositories

Key Requirements

  • 5+ years of professional experience in IT
  • 3+ years specializing in observability, monitoring, or SRE
  • Deep knowledge of monitoring toolsets
  • Proficiency in Python for automation
  • Hands-on experience with Kubernetes, Docker, and cloud platforms
  • Strong understanding of networking, infrastructure, and performance optimization
  • Experience with IaC tools such as Terraform
  • Familiarity with configuration management tools
  • Proven track record designing and delivering dashboards, alerts, and performance reports

Work Rights

Not specified

Tailored Resume

Cover Letter