Crunchyroll is seeking a Senior Reliability Engineer to join their new Partner Reliability Engineering team, focusing on enhancing the reliability of their living room device ecosystem and payment systems. The ideal candidate will have extensive experience in site reliability engineering and automation, along with a passion for improving system observability
Job Summary
Lead incident response, on-call support, and mitigation for device and payment-related issues in production environments.
Develop and evolve monitoring, alerting, and triage tooling to improve time to detection and resolution.
Work directly with external partners (e.g., Smart TV and device manufacturers, ISPs, payment providers) to investigate and resolve ecosystem issues.
Matching Summary
Match Score: 85
Crunchyroll is seeking a Senior Reliability Engineer to join their new Partner Reliability Engineering team, focusing on enhancing the reliability of their living room device ecosystem and payment systems. The ideal candidate will have extensive experience in site reliability engineering and automation, along with a passion for improving system observability.
Skills & Requirements
Must-have
Incident response and on-call support
Develop monitoring and alerting tools
Analyze data for anomalies and trends
Build automation and internal tools
Collaborate with internal teams
Work with external partners
Proficient in Python, Go, Java, or Node.js
Nice-to-have
Thrive in ambiguity
Automate repetitive work
Strong sense of ownership
Obsessed with root causes
Key Requirements
8+ years of practical experience
Experience in SRE, Support, or Incident Response roles
Hands-on experience with consumer devices
Familiar with production alerting and monitoring tools
Strong SQL skills
Experience with dashboards and analytics platforms
Familiar with ETL frameworks or data pipeline infrastructure