Site Reliability Engineer | Ai Infrastructure

Jll Com Sg

Tel Aviv, Israel
Base salary; rsus; annual bonus
Hybrid
Cloud platforms (azure or aws)
Containerization (docker, kubernetes)
Ci/cd pipelines
You will own the platform layer for AI agents, including deployment architecture, observability, and production reliability

Job Summary

  • You will own the platform layer for AI agents, including deployment architecture, observability, and production reliability.
  • The role involves designing deployment and observability for LLM-backed services, tracking output quality, cost per invocation, and model drift.
  • You will also write agent code in TypeScript and Python, work with data pipelines, and ship features alongside the team.

Matching Summary

You will own the platform layer for AI agents, including deployment architecture, observability, and production reliability.

Salary

Base salary; RSUs; annual bonus

Skills & Requirements

Must-have

  • Cloud platforms (Azure or AWS)
  • Containerization (Docker, Kubernetes)
  • CI/CD pipelines
  • Infrastructure-as-code (Terraform, CDK, CloudFormation)
  • Monitoring and observability tools
  • Linux, networking, security fundamentals
  • Incident management experience

Nice-to-have

  • AI/ML infrastructure experience
  • Production code in TypeScript or Python
  • Self-service developer tooling
  • Cost optimization for cloud workloads
  • Enterprise security engineering

Key Requirements

  • 5+ years in SRE, platform engineering, DevOps, or infrastructure roles
  • Experience owning infrastructure end-to-end
  • Comfortable working independently with broad ownership and high accountability
  • Strong written and verbal English

Work Rights

Not specified

Tailored Resume

Cover Letter