Sr Systems Reliability Engineer

Skywalker Sound

Nicasio, CA, United States
Base: $155,400 to $208,400 py; bonus/equity: may b...
3d onsite
Kubernetes cluster management
Ci/cd pipeline development
Aws infrastructure management
Design, manage and maintain critical infrastructure for both software development and deployed global production resources

Job Summary

  • Design, manage and maintain critical infrastructure for both software development and deployed global production resources.
  • Collaborate on the provisioning of cloud infrastructure in AWS using Terraform to ensure consistency and scalability.
  • Monitor, troubleshoot, and optimize build and deployment processes to maximize efficiency and minimize downtime.

Matching Summary

Design, manage and maintain critical infrastructure for both software development and deployed global production resources.

Salary

Base: $155,400 to $208,400 per year; Bonus/Equity: May be provided; Benefits: Full range of medical, financial, and/or other benefits

Skills & Requirements

Must-have

  • Kubernetes cluster management
  • CI/CD pipeline development
  • AWS infrastructure management
  • Infrastructure as Code (Terraform)
  • Observability tools (Datadog, Splunk)
  • Containerization (Docker, Kubernetes)

Nice-to-have

  • Media and entertainment pipelines
  • AI/ML framework integration
  • GitOps workflows
  • Serverless computing paradigms

Key Requirements

  • BS Degree in Computer Science
  • 5+ years of experience in SRE/DevOps
  • Extensive AWS knowledge
  • Proficiency with observability tools
  • Proficiency with GitLab CI, Terraform, Helm, Packer
  • Demonstrated CI/CD pipeline experience
  • In-depth knowledge of Containers and Orchestration
  • Strong scripting skills (Python, Bash)

Work Rights

Not specified

Tailored Resume

Cover Letter