Data Steward Engineer

Lilly

Indianapolis, Indiana, US
Azure databricks platform expertise
Pharma manufacturing data systems
Data profiling and quality assessment
You will serve as the technical and functional data steward for our pharma manufacturing data analytics platform, built on Azure Databricks

Job Summary

  • You will serve as the technical and functional data steward for our pharma manufacturing data analytics platform, built on Azure Databricks.
  • You will conduct systematic data profiling, data quality assessments, and root cause analysis across Azure Databricks data assets, identifying and resolving data anomalies, gaps, duplicates, and inconsistencies at the source and in the data pipeline.
  • By collaborating with source system owners and IT teams, you will trace data provenance, map data flows from source systems into the Databricks Lakehouse, and document transformation logic to support data traceability for GMP compliance.

Matching Summary

You will serve as the technical and functional data steward for our pharma manufacturing data analytics platform, built on Azure Databricks.

Skills & Requirements

Must-have

  • Azure Databricks platform expertise
  • Pharma manufacturing data systems
  • Data profiling and quality assessment
  • Data cleansing and transformation pipelines
  • Data lineage and metadata documentation
  • GMP data integrity principles

Nice-to-have

  • Subject matter expertise in manufacturing data
  • Bridging technical and business language
  • Collaboration with data governance
  • Proactive data quality monitoring

Key Requirements

  • 6+ years in data engineering, stewardship, or management
  • 5+ years with manufacturing data systems (MES, LIMS, ERP, PI)
  • 3+ years with Azure Databricks or comparable cloud lakehouse
  • Bachelor's degree or equivalent practical experience
  • Proficiency in Python and SQL

Work Rights

Not specified

Tailored Resume

Cover Letter