Mlops & Agentic Platform Engineer (ai Infrastructure)

Hyphen Connect

Australia, Australia
On-site
Manage model registries and continuous training loops
Deploy agents as scalable microservices on kubernetes
Build observability dashboards for token usage and latency
Hyphen Connect is seeking a skilled MLOps & Agentic Platform Engineer to manage model registries, develop continuous training loops, and implement A/B testing infrastructure. The ideal candidate will have a solid DevOps/MLOps background and experience in deploying scalable microservices, particularly using Kubernetes and related technologies

Job Summary

  • The role involves managing model registries, developing continuous training loops, and implementing A/B testing infrastructure.
  • Candidates will deploy AI agents as scalable microservices on Kubernetes while building comprehensive observability dashboards.
  • The ideal candidate possesses a strong DevOps/MLOps background and is adept at tracking token usage and agent reasoning paths.

Matching Summary

Match Score: 85

Hyphen Connect is seeking a skilled MLOps & Agentic Platform Engineer to manage model registries, develop continuous training loops, and implement A/B testing infrastructure. The ideal candidate will have a solid DevOps/MLOps background and experience in deploying scalable microservices, particularly using Kubernetes and related technologies.

Skills & Requirements

Must-have

  • Manage model registries and continuous training loops
  • Deploy agents as scalable microservices on Kubernetes
  • Build observability dashboards for token usage and latency

Nice-to-have

  • Strong DevOps background with Terraform experience
  • Knowledge of MLflow, Weights & Biases, or LangSmith
  • Experience building scalable microservice architectures

Key Requirements

  • Strong DevOps/MLOps background required
  • Experience with Kubernetes and Docker
  • Proficiency in Terraform for infrastructure

Work Rights

Not specified

Tailored Resume

Cover Letter