Senior Data Engineer Python/gcp (x/f/m)

Doctolib GmbH

Paris, France
On-site
5+ years data engineering experience
Google cloud platform (gcp) ecosystem
Python and sql proficiency
The role focuses on building scalable data pipelines on Google Cloud Platform to power AI Medical Companion models including LLMs and VLMs

Job Summary

  • The role focuses on building scalable data pipelines on Google Cloud Platform to power AI Medical Companion models including LLMs and VLMs.
  • You will architect NoSQL and Vector Databases to efficiently store embeddings and documents for retrieval systems.
  • Your work directly supports health professionals in delivering better care to millions of patients across Europe.

Matching Summary

The role focuses on building scalable data pipelines on Google Cloud Platform to power AI Medical Companion models including LLMs and VLMs.

Skills & Requirements

Must-have

  • 5+ years Data Engineering experience
  • Google Cloud Platform (GCP) ecosystem
  • Python and SQL proficiency
  • NoSQL and Vector Database architecture
  • RAG and embedding pipeline design

Nice-to-have

  • Master's or Ph.D. in Computer Science
  • Experience with healthcare data compliance
  • Fluent in English language
  • Knowledge of data governance frameworks

Key Requirements

  • 5+ years of Data Engineering experience
  • Strong background in AI or ML workloads
  • Deep understanding of NoSQL systems like MongoDB
  • Experience designing data architectures for RAG
  • Knowledge of data governance and security

Work Rights

Not specified

Tailored Resume

Cover Letter