Head Of Ai Infrastructure & Machine Learning Operations
Apex Group
Multiple Locations
4 days onsite
Scalable ai runtime environment
Robust mlops stack
Secure, compliant ai development architectures
Apex Group is seeking a Head of AI Infrastructure & Machine Learning Operations to lead the development of scalable AI platforms and MLOps strategies within a highly regulated financial services environment. The role entails building secure infrastructures to support AI innovation and operational excellence, focusing on collaboration and compliance
Job Summary
Establish a scalable AI runtime environment to support rapid prototyping and early deployment of LLM agents and agentic workflows.
Design and implement a robust MLOps stack with model versioning, CI/CD pipelines, and automated monitoring for operational resilience.
Build secure, compliant AI development and deployment architectures while aligning with AI governance framework.
Matching Summary
Match Score: 85
Apex Group is seeking a Head of AI Infrastructure & Machine Learning Operations to lead the development of scalable AI platforms and MLOps strategies within a highly regulated financial services environment. The role entails building secure infrastructures to support AI innovation and operational excellence, focusing on collaboration and compliance.
Skills & Requirements
Must-have
scalable AI runtime environment
robust MLOps stack
secure, compliant AI development architectures
cross-functional collaboration
cloud-native or hybrid AI/ML platforms
end-to-end MLOps pipelines
model deployment and runtime management
monitoring, observability, and incident response
building secure, compliant AI infrastructure
Nice-to-have
strategic thinker
responsible AI passion
platform resilience
infrastructure innovation
high-ambiguity environments
collaborative leader
ethical technology adoption
Key Requirements
Proven leadership in designing scalable platforms
Deep expertise in MLOps strategy and execution
Hands-on experience with model deployment
Strong background in monitoring, observability, and incident response
Skilled in building secure, compliant AI infrastructure