Site Reliability Engineer

Pendo.io

Herzliya, IL, United States
On-site
Infrastructure-as-code automation
Ci/cd pipeline management
Production incident management
Pendo.io is seeking a Site Reliability Engineer to enhance their cloud infrastructure management, focusing on reliability, performance, and cost-efficiency. The role involves collaboration with developers and product managers to automate and maintain systems while ensuring high service availability

Job Summary

  • The SRE team is responsible for provisioning and maintaining cloud infrastructure from development through production, ensuring reliability, performance, and cost-efficiency.
  • Responsibilities include writing infrastructure-as-code, developing maintainable code with an operations focus, debugging production issues, and participating in a 24x7 on-call rotation.
  • Pendo is a fast-growing startup with a passionate, dynamic, and fun culture, offering experience in diverse technologies and a real impact on the company's future.

Matching Summary

Match Score: 85

Pendo.io is seeking a Site Reliability Engineer to enhance their cloud infrastructure management, focusing on reliability, performance, and cost-efficiency. The role involves collaboration with developers and product managers to automate and maintain systems while ensuring high service availability.

Skills & Requirements

Must-have

  • Infrastructure-as-code automation
  • CI/CD pipeline management
  • Production incident management
  • Kubernetes (GKE) expertise
  • Go or Python programming
  • Distributed systems design
  • Service Level Objectives (SLOs)

Nice-to-have

  • Cost-efficiency in cloud infrastructure
  • Security and compliance collaboration
  • Proactive capacity planning
  • On-call rotation participation
  • Runbook automation

Key Requirements

  • Experience with Ansible or Terraform
  • Experience with Kubernetes in production
  • Experience as SRE or DevOps Engineer

Work Rights

Not specified

Tailored Resume

Cover Letter