Site Reliability Engineer

ITCAN PTE. LIMITED

Singapore
5+ years java application experience
Production operations management
Log analysis and observability tools
The role requires engaging with product and development teams from the start of the SDLC to ensure reliability and resilience

Job Summary

  • The role requires engaging with product and development teams from the start of the SDLC to ensure reliability and resilience.
  • Candidates must analyze complex distributed systems, identify instability sources, and drive operational excellence through automation.
  • The position involves performing code bug fixes in production and providing technical guidance to junior team members.

Matching Summary

Match Score: 85

The role requires engaging with product and development teams from the start of the SDLC to ensure reliability and resilience.

Skills & Requirements

Must-have

  • 5+ years Java application experience
  • Production operations management
  • Log analysis and observability tools
  • Docker and Kubernetes knowledge
  • Full stack application knowledge

Nice-to-have

  • Strong leadership and work ethic
  • Excellent communication skills
  • Fast-paced team environment
  • REST technologies expertise
  • Kafka and caching systems

Key Requirements

  • 5+ years experience with Java applications
  • Experience handling production operations
  • Knowledge of SQL/NoSQL databases
  • Hands-on experience with Docker/Kubernetes
  • Proficiency with Grafana, Prometheus, or Splunk

Work Rights

Not specified

Tailored Resume

Cover Letter