Senior Site Reliability Engineer

Centific Global

Not specified (assumed to be flexible based on the role's technical nature)
5+ years sre or devops experience
Hands-on observability tools like prometheus
Strong incident management and rca skills
Centific Global is seeking a Senior Site Reliability Engineer with expertise in observability and DevOps to enhance the reliability and operational excellence of their cloud-native platform on Microsoft Azure. The ideal candidate should have at least five years of experience in SRE or related roles, along with hands-on experience using observability tools and a strong problem-solving mindset

Job Summary

  • Centific is a frontier AI data foundry empowering enterprise clients with safe, scalable AI deployment using purpose-built technology platforms.
  • The role focuses on improving system reliability, monitoring, and operational excellence while partnering closely with Product and Engineering teams.
  • Candidates must have strong ownership and problem-solving skills with a passion for reliability, observability, and automation.

Matching Summary

Match Score: 85

Centific Global is seeking a Senior Site Reliability Engineer with expertise in observability and DevOps to enhance the reliability and operational excellence of their cloud-native platform on Microsoft Azure. The ideal candidate should have at least five years of experience in SRE or related roles, along with hands-on experience using observability tools and a strong problem-solving mindset.

Skills & Requirements

Must-have

  • 5+ years SRE or DevOps experience
  • Hands-on observability tools like Prometheus
  • Strong incident management and RCA skills
  • Proficiency in Python, Bash, or PowerShell
  • Experience with Azure services including AKS

Nice-to-have

  • Cloud architecture design and security best practices
  • Performance tuning and capacity planning expertise
  • Ability to debug application code
  • Cost optimization experience
  • Collaboration across distributed global teams

Key Requirements

  • 5+ years of SRE, DevOps, or Production Engineering experience
  • Hands-on experience with Azure Monitor and OpenTelemetry
  • Proven track record in production incident response

Work Rights

Not specified

Tailored Resume

Cover Letter