Senior Software Engineer, Reliability Engineering

Airbnb

São Paulo, Brazil
On-site
Develop and maintain reliability tools
Incident response and management
Large scale distributed systems
As a Senior Software Engineer in Production SRE, you will be responsible for developing and maintaining the tools and systems that enable our engineering teams to operate our services reliably and at scale

Job Summary

  • As a Senior Software Engineer in Production SRE, you will be responsible for developing and maintaining the tools and systems that enable our engineering teams to operate our services reliably and at scale.
  • Your expertise in developing and maintaining tools and systems will be instrumental in bolstering our services' reliability and improving how the company manages incidents broadly.
  • Additionally, as an essential part of this role, you will serve as an active member of the Production SRE team, responding to and managing high severity incidents.

Matching Summary

As a Senior Software Engineer in Production SRE, you will be responsible for developing and maintaining the tools and systems that enable our engineering teams to operate our services reliably and at scale.

Skills & Requirements

Must-have

  • Develop and maintain reliability tools
  • Incident response and management
  • Large scale distributed systems
  • Cloud computing platforms
  • Containerization technologies

Nice-to-have

  • Foster culture of reliability
  • Continuous learning and improvement
  • Blameless post-mortems
  • Mentoring less experienced engineers

Key Requirements

  • 5+ years of experience in software engineering or SRE roles
  • Bachelor's degree in Computer Science or related field
  • Strong coding skills in Java, Python, or Go
  • Experience with distributed systems and service-oriented architectures
  • Experience with AWS or Google Cloud Platform
  • Experience with Docker and Kubernetes
  • Fluent in English (Professional Level)

Work Rights

Not specified

Tailored Resume

Cover Letter