Global Payments is seeking an AI Support Engineer to join their AI Operations team, responsible for monitoring and resolving production incidents for AI solutions. The role demands a strong technical background in AI systems, cloud infrastructure, and incident management
Job Summary
Serve as the first line of defense for production AI incidents, ensuring rapid triage, root cause analysis, and resolution.
Monitor system health and performance of deployed AI applications, agentic and RAG-based solutions, MCPs, and orchestration platforms.
Collaborate with AI and platform engineers to implement observability, logging, and alerting best practices for all AI services.
Matching Summary
Match Score: 85
Global Payments is seeking an AI Support Engineer to join their AI Operations team, responsible for monitoring and resolving production incidents for AI solutions. The role demands a strong technical background in AI systems, cloud infrastructure, and incident management.
Skills & Requirements
Must-have
production AI incident resolution
monitoring AI system health
troubleshooting model drift and hallucination
cloud infrastructure (AWS, GCP)
LLM and GenAI systems
Python or shell scripting
incident management skills
Nice-to-have
prompt engineering
AI governance and safety
reinforcement learning
big data technologies
CI/CD automation
Key Requirements
4+ years of experience in production support, SRE, or DevOps
1+ years of experience in AI/ML engineering
Bachelor’s degree in Computer Science, Engineering, or related field
Experience with modern orchestration and agentic frameworks
Hands-on experience with GCP Vertex AI, AWS Bedrock + Sagemaker
Availability for on-call rotation
Work Rights
Must be legally authorized to work for any employer in the United States