Site Reliability Engineer | Ai Infrastructure

JLL

Tel Aviv, Israel
Base: not specified; bonus/equity: rsus (st + ard ...
Hybrid
Sre, platform engineering, devops, infrastructure
Azure or aws cloud platforms
Containerization (docker, kubernetes)
You will own the platform layer for AI agents, focusing on deployment architecture, observability, and production reliability

Job Summary

  • You will own the platform layer for AI agents, focusing on deployment architecture, observability, and production reliability.
  • The role involves designing deployment and observability for LLM-backed services, tracking output quality, cost per invocation, and model drift.
  • You will also write agent code in TypeScript and Python, work with data pipelines, and ship features alongside the team.

Matching Summary

You will own the platform layer for AI agents, focusing on deployment architecture, observability, and production reliability.

Salary

Base: Not specified; Bonus/Equity: RSUs (standard 4-year vest), annual bonus; Benefits: Keren hishtalmut

Skills & Requirements

Must-have

  • SRE, platform engineering, DevOps, infrastructure
  • Azure or AWS cloud platforms
  • Containerization (Docker, Kubernetes)
  • CI/CD pipelines
  • Infrastructure-as-code (Terraform, CDK, CloudFormation)
  • Monitoring and observability tools
  • Linux, networking, security fundamentals
  • Incident management experience

Nice-to-have

  • AI/ML infrastructure experience
  • Production code in TypeScript or Python
  • Self-service developer tooling
  • Cost optimization for cloud workloads
  • Enterprise security engineering

Key Requirements

  • 5+ years in SRE, platform engineering, DevOps, or infrastructure roles
  • Experience owning infrastructure end-to-end
  • Comfortable working independently with broad ownership

Work Rights

Not specified

Tailored Resume

Cover Letter