This role is responsible for designing and maintaining AI-ready data pipelines that power breakthrough discoveries and improve patient outcomes within Sanofi's regulated biopharma environment
Job Summary
This role is responsible for designing and maintaining AI-ready data pipelines that power breakthrough discoveries and improve patient outcomes within Sanofi's regulated biopharma environment.
The specialist will implement automated data preparation workflows using AI agent tooling to enable enterprise-wide AI adoption while ensuring strict adherence to GDPR, HIPAA, and other regulatory requirements.
Candidates will have the opportunity to lead the implementation of industry-leading AI platforms including DataOrion and Snowflake as part of Sanofi's transformation into an AI-first organization.
Matching Summary
This role is responsible for designing and maintaining AI-ready data pipelines that power breakthrough discoveries and improve patient outcomes within Sanofi's regulated biopharma environment.
Skills & Requirements
Must-have
5+ years data engineering experience
Cloud platforms Snowflake AWS Azure GCP
Data pipeline tools Apache Spark dbt Airflow
Informatica CDGC or similar governance tools
Python SQL proficiency
GDPR HIPAA compliance knowledge
Metadata management and data lineage
Nice-to-have
Generative AI and LLM experience
Pharmaceutical R&D domain knowledge
Unstructured data processing NLP
Real-time streaming architectures Kafka
AI governance frameworks understanding
Cloud platform certifications
Agentic AI systems experience
Key Requirements
Bachelor's degree in Computer Science or related field (Master's preferred)
5+ years hands-on experience in data engineering
Experience in pharmaceutical or regulated industry environments preferred
Proficiency in schema-as-code and automated catalog management