Senior Software Engineer - Site Reliability Engineering (remote)

Home Depot

Base: $90,000.00 - $180,000.00; bonus/equity: not ...
Fully remote
Google cloud platform gcp experience
Kubernetes gke deployment and management
Terraform infrastructure as code
The Home Depot is seeking a Senior Software Engineer for Site Reliability Engineering to design and maintain internal platforms that enhance the reliability and observability of store systems. This remote role emphasizes building tools for operational automation, incident response, and performance testing, with a strong focus on cloud technologies and collaboration

Job Summary

  • This role involves designing and maintaining the tools that hundreds of development and reliability teams depend on to keep store systems observable and reliable.
  • You will extend a custom-built synthetic testing system running inside physical Home Depot stores while managing the full observability stack including logging and tracing.
  • The position requires reducing operational toil through automation, including building Copilot skills and self-service capabilities for engineering teams.

Matching Summary

Match Score: 85

The Home Depot is seeking a Senior Software Engineer for Site Reliability Engineering to design and maintain internal platforms that enhance the reliability and observability of store systems. This remote role emphasizes building tools for operational automation, incident response, and performance testing, with a strong focus on cloud technologies and collaboration.

Salary

Base: $90,000.00 - $180,000.00; Bonus/Equity: Not specified; Benefits: Not specified

Skills & Requirements

Must-have

  • Google Cloud Platform GCP experience
  • Kubernetes GKE deployment and management
  • Terraform infrastructure as code
  • Observability stack configuration
  • SLO and error budget management
  • Go Python or TypeScript proficiency

Nice-to-have

  • AI-assisted workflow development
  • Synthetic monitoring platform building
  • CDK8s programmatic IaC familiarity
  • Blameless post-mortem leadership
  • Self-service tooling creation

Key Requirements

  • 3-5 years Site Reliability Engineering experience
  • Must be legally permitted to work in the United States
  • Bachelor's degree or equivalent in related field

Work Rights

Must be legally permitted to work in the United States

Tailored Resume

Cover Letter