Senior Site Reliability Engineer, Ai Research

Algolia

Remote, Australia
Remote
Cloud-first infrastructure
Production services on kubernetes
Infrastructure-as-code (terraform)
Algolia is seeking a Senior Site Reliability Engineer for its AI Research team to ensure the reliability and scalability of cloud infrastructure. This remote position emphasizes strong SRE fundamentals over AI-specific experience, focusing on collaboration with cross-functional teams to support innovative projects

Job Summary

  • Support and evolve the reliability of platforms used by the AI Research team, including production inference services and AI data feature stores.
  • Build and maintain Kubernetes-based services on GCP using infrastructure-as-code and GitOps, owning and improving CI/CD pipelines for Go and Python services.
  • This role offers high impact by enabling new AI-powered capabilities, high agency in shaping what and how it's built, and collaboration with experienced peers.

Matching Summary

Match Score: 85

Algolia is seeking a Senior Site Reliability Engineer for its AI Research team to ensure the reliability and scalability of cloud infrastructure. This remote position emphasizes strong SRE fundamentals over AI-specific experience, focusing on collaboration with cross-functional teams to support innovative projects.

Skills & Requirements

Must-have

  • cloud-first infrastructure
  • production services on Kubernetes
  • infrastructure-as-code (Terraform)
  • CI/CD systems
  • operating production services
  • Go programming language

Nice-to-have

  • supporting mission-critical internal platforms
  • research or experimentation-heavy environments
  • working alongside researchers
  • high ownership
  • ambiguity

Key Requirements

  • Strong experience operating cloud-first infrastructure
  • Hands-on experience running production services on Kubernetes
  • Proficiency with infrastructure-as-code (Terraform) and CI/CD systems
  • Experience supporting production services written in Go
  • Solid grounding in service reliability, incident response, and operational best practices
  • Comfort working in environments with ambiguity

Work Rights

Not specified

Tailored Resume

Cover Letter