Site Reliability Engineer

GLOBAL PAYMENT HOLDING COMPANY

Prague, Czech Republic
Apigee api implementations
Production incident management
Monitoring and alerting
Serve as the first line of defense for production incidents, ensuring rapid triage, root cause analysis, and resolution

Job Summary

  • Serve as the first line of defense for production incidents, ensuring rapid triage, root cause analysis, and resolution.
  • Collaborate with platform engineers to implement observability, logging, and alerting best practices for API services.
  • Join a dynamic team passionate about learning, applying cutting-edge and cost effective technologies, and innovating to deliver high-quality, and highly available API solutions.

Matching Summary

Serve as the first line of defense for production incidents, ensuring rapid triage, root cause analysis, and resolution.

Skills & Requirements

Must-have

  • Apigee API Implementations
  • production incident management
  • monitoring and alerting
  • Python or shell scripting
  • cloud infrastructure (GCP preferred)

Nice-to-have

  • proactive initiative and accuracy
  • managing multiple assignments
  • strong interpersonal skills
  • critical thinking ability
  • flexibility in adapting to needs

Key Requirements

  • 3+ years of experience in production support, SRE, or DevOps
  • Bachelor’s degree in Computer Science, Engineering, or related field
  • English B2-C1, Czech B1-B2 proficiency
  • Experience with CI/CD tools and Alerts/Monitoring automation
  • Familiarity with API integrations

Work Rights

Not specified

Tailored Resume

Cover Letter