Senior Ai Data Engineer

IQVIA

Multiple Locations
Hybrid
Scalable data pipelines and etl
Data models for analytics and ml
Data governance and security
Lead the development and optimization of data infrastructure supporting Agentic AI initiatives

Job Summary

  • Lead the development and optimization of data infrastructure supporting Agentic AI initiatives.
  • Collaborate with ML engineers, AI scientists, and product managers to architect, implement, and maintain robust data pipelines powering autonomous AI agents.
  • Drive data platform reliability, scalability, and cost optimization across cloud-based infrastructure.

Matching Summary

Lead the development and optimization of data infrastructure supporting Agentic AI initiatives.

Skills & Requirements

Must-have

  • Scalable data pipelines and ETL
  • Data models for analytics and ML
  • Data governance and security
  • Observability and monitoring frameworks
  • Distributed computing frameworks (Spark, Dask, Ray)
  • Streaming technologies (Kafka, Flink)
  • Cloud-based infrastructure (AWS, Azure, GCP)

Nice-to-have

  • Resilient data architectures for AI agents
  • Real-time agent feedback and telemetry
  • Vector databases and knowledge graphs
  • Automated data validation and quality checks
  • Reducing latency for agent decision-making
  • LLM fine-tuning data requirements
  • RAG systems and related data pipelines

Key Requirements

  • 5+ years of professional experience in data engineering
  • 2+ years focused on ML/AI data infrastructure
  • Advanced proficiency in Python and Scala
  • Expert-level knowledge of SQL and NoSQL databases
  • Hands-on experience with vector databases
  • Proficiency with modern data orchestration platforms
  • Extensive experience with major cloud platforms
  • Expertise in containerization and orchestration
  • Experience with Infrastructure as Code tooling
  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or related field

Work Rights

Not specified

Tailored Resume

Cover Letter