Site Reliability Engineer | Ai Infrastructure

JLL

Tel Aviv, Israel
Base: not specified; bonus/equity: rsus (st + ard ...
Hybrid
Sre, platform engineering, devops, infrastructure
Cloud platforms (azure or aws)
Containerization (docker, kubernetes)
You will own the platform layer for AI agents, including deployment architecture, observability, and production reliability

Job Summary

  • You will own the platform layer for AI agents, including deployment architecture, observability, and production reliability.
  • The role involves designing deployment and observability for LLM-backed services, tracking output quality, cost per invocation, and model drift.
  • You will also write agent code in TypeScript and Python, work with data pipelines, and ship features alongside the team.

Matching Summary

You will own the platform layer for AI agents, including deployment architecture, observability, and production reliability.

Salary

Base: Not specified; Bonus/Equity: RSUs (standard 4-year vest), annual bonus; Benefits: Keren hishtalmut

Skills & Requirements

Must-have

  • SRE, platform engineering, DevOps, infrastructure
  • Cloud platforms (Azure or AWS)
  • Containerization (Docker, Kubernetes)
  • CI/CD pipelines
  • Infrastructure-as-code (Terraform, CDK, CloudFormation)
  • Monitoring and observability tools
  • Linux, networking, security fundamentals
  • Incident management and on-call experience

Nice-to-have

  • AI/ML infrastructure experience
  • Production code in TypeScript or Python
  • Self-service developer tooling
  • Cloud and API cost optimization
  • Enterprise security engineering

Key Requirements

  • 5+ years in SRE, platform engineering, DevOps, or infrastructure
  • Experience owning infrastructure end-to-end
  • Comfortable working independently with broad ownership

Work Rights

Not specified

Tailored Resume

Cover Letter