Sre, platform engineering, devops, or infrastructure roles
Cloud platforms (azure or aws)
Containerization (docker, kubernetes)
You will own the platform layer for AI agents, including deployment architecture, observability, and production reliability, in a greenfield environment with real constraints
Job Summary
You will own the platform layer for AI agents, including deployment architecture, observability, and production reliability, in a greenfield environment with real constraints.
The role involves building monitoring and observability for AI services, tracking output quality, cost per invocation, and model drift, while ensuring sensitive data flows meet audit requirements.
This position offers enterprise complexity with startup autonomy, requiring you to write agent code in TypeScript and Python, work with data pipelines, and ship features alongside the team.
Matching Summary
You will own the platform layer for AI agents, including deployment architecture, observability, and production reliability, in a greenfield environment with real constraints.
Skills & Requirements
Must-have
SRE, platform engineering, DevOps, or infrastructure roles