5+ years professional experience in data engineering
2+ years focused on ml/ai data infrastructure
Advanced proficiency in python and scala programming
This role involves leading the development of data infrastructure specifically supporting Agentic AI initiatives within IQVIA's R&DS AI Innovation Program
Job Summary
This role involves leading the development of data infrastructure specifically supporting Agentic AI initiatives within IQVIA's R&DS AI Innovation Program.
The successful candidate will collaborate with AI scientists and engineers to architect robust data pipelines that power autonomous AI agents.
Key responsibilities include implementing data governance, security measures, and observability frameworks to ensure high-quality training data and compliance.
Matching Summary
This role involves leading the development of data infrastructure specifically supporting Agentic AI initiatives within IQVIA's R&DS AI Innovation Program.
Skills & Requirements
Must-have
5+ years professional experience in data engineering
2+ years focused on ML/AI data infrastructure
Advanced proficiency in Python and Scala programming
Expert-level knowledge of SQL and NoSQL databases
Hands-on experience with vector databases like Pinecone or Weaviate
Proficiency with modern data orchestration platforms like Airflow 2.x
Extensive experience with major cloud platforms AWS Azure or GCP
Nice-to-have
Experience with Rust Go Java or Julia programming languages
Knowledge of LLM fine-tuning data requirements and processing
Experience developing data systems for autonomous AI agents
Familiarity with RAG systems and related data pipelines
Understanding of RLHF data workflows and semantic caching
Experience mentoring junior engineers and establishing best practices
Working knowledge of ML frameworks like PyTorch or TensorFlow
Key Requirements
Bachelor's or Master's degree in Computer Science or Data Engineering
5+ years of professional experience in data engineering
At least 2 years focused on ML/AI data infrastructure