Lead Engineer, Site Reliability Engineering

Mastercard

Site reliability engineering
Payments network sre
Monitoring and alerting
Mastercard's Program aligned Site Reliability Engineering (SRE) teams are dedicated to delivering a seamless experience for our customers by maintaining every aspect of our Programs infrastructure and technology ecosystem to the highest standards

Job Summary

  • Mastercard's Program aligned Site Reliability Engineering (SRE) teams are dedicated to delivering a seamless experience for our customers by maintaining every aspect of our Programs infrastructure and technology ecosystem to the highest standards.
  • In this role, you will join our Payments Network SRE team and take ownership of continuously assessing and elevating the end to end service quality of our platform.
  • Leverage automation and AI technologies to enhance proactive issue detection, enable self-healing capabilities, reducing Mean Time to Detect (MTTD) and Mean Time to Mitigate (MTTM).

Matching Summary

Mastercard's Program aligned Site Reliability Engineering (SRE) teams are dedicated to delivering a seamless experience for our customers by maintaining every aspect of our Programs infrastructure and technology ecosystem to the highest standards.

Skills & Requirements

Must-have

  • Site Reliability Engineering
  • Payments Network SRE
  • monitoring and alerting
  • capacity analysis
  • root cause analysis
  • packet level debugging
  • Infrastructure as Code tools

Nice-to-have

  • continuous learning and knowledge sharing
  • AI technologies
  • vendor hardware evaluation

Key Requirements

  • 5–10 years of experience in SRE
  • 3+ years supporting e-commerce, financial services, or large scale SaaS platforms
  • Excellent infrastructure troubleshooting skills
  • Strong hands on experience with observability and monitoring tools
  • Familiarity with network telemetry tools
  • Proficiency in packet level debugging
  • Broad understanding of end to end infrastructure
  • Experience with automation and Infrastructure as Code tools
  • Excellent communication skills
  • Demonstrated ability to troubleshoot complex production issues
  • Experience partnering with development teams
  • Strong understanding of monitoring and observability ecosystems
  • Effective incident management skills

Work Rights

Not specified

Tailored Resume

Cover Letter