Advisor - Scientific Data Engineer

Lilly

Base: $166,500 - $266,200; bonus/equity: compyny b...
8 years data engineering experience
Etl/elt pipeline development
Lakehouse architecture design
The role involves building an AI foundation that connects scientists to petabyte-scale data through natural language interfaces and automated workflows

Job Summary

  • The role involves building an AI foundation that connects scientists to petabyte-scale data through natural language interfaces and automated workflows.
  • You will design and maintain a semantic layer over multi-omics databases to enable AI systems and create gold-standard question/SQL pairs for training.
  • The position offers a comprehensive benefit program including medical, dental, vision, 401(k), and eligibility for a company bonus based on performance.

Matching Summary

The role involves building an AI foundation that connects scientists to petabyte-scale data through natural language interfaces and automated workflows.

Salary

Base: $166,500 - $266,200; Bonus/Equity: Company bonus depending on performance; Benefits: Medical, dental, vision, 401(k), vacation, wellness programs

Skills & Requirements

Must-have

  • 8 years data engineering experience
  • ETL/ELT pipeline development
  • Lakehouse architecture design
  • Semantic layer engineering
  • Vector embedding pipelines

Nice-to-have

  • PhD in data or related field
  • Experience with biomedical ontologies
  • Knowledge of knowledge graph technologies
  • Deep Databricks ecosystem expertise
  • Familiarity with Nextflow and Bioconductor

Key Requirements

  • Bachelors degree plus 8 years experience OR Masters plus 5 years
  • Strong SQL skills with complex relational schemas
  • Proficiency in Python for data processing
  • Experience with cloud platforms like AWS
  • Knowledge of data governance in regulated industries

Work Rights

Not specified

Tailored Resume

Cover Letter