Observability with opentelemetry prometheus grafana
This role involves leading the design and operation of MLOps platforms to support critical business decisions and enhance patient experiences
Job Summary
This role involves leading the design and operation of MLOps platforms to support critical business decisions and enhance patient experiences.
The team is at the forefront of Pfizer's transformation into a digitally driven organization using data science and AI to change patients' lives.
Candidates will own operational reliability for ML platforms, lead incident response, and continuously improve MLOps maturity using SRE-inspired practices.
Matching Summary
This role involves leading the design and operation of MLOps platforms to support critical business decisions and enhance patient experiences.
Skills & Requirements
Must-have
MLOps platform execution and model operations
CI/CD pipelines for ML workloads
Observability with OpenTelemetry Prometheus Grafana
AWS or Azure cloud-native environments
Model validation and regression testing
Nice-to-have
Experience with generative AI tools like Copilot
Background in responsible AI governance
Master's degree in Computer Science or Data Science
Kubernetes certifications CKA or CKAD
Curiosity for exploring new AI productivity tools
Key Requirements
BA/BS degree with 6+ years of experience
Strong hands-on experience operationalizing ML systems
Proficiency in Python Bash SQL and ML frameworks
Demonstrated leadership abilities and cross-functional collaboration