Solventum is seeking a Senior Data Engineer specializing in AI and ML enablement, focused on creating robust data pipelines and ensuring data quality. The position is remote and emphasizes collaboration with healthcare professionals, requiring strong technical skills in cloud environments and data processing frameworks
Job Summary
Solventum enables better, smarter, safer healthcare to improve lives.
As a Senior Data Engineer, you will design, build, and maintain robust ETL/ELT pipelines and support embedding generation for AI/GenAI use cases.
The company partners closely with the brightest minds in healthcare to ensure every solution melds the latest technology with compassion and empathy.
Matching Summary
Match Score: 85
Solventum is seeking a Senior Data Engineer specializing in AI and ML enablement, focused on creating robust data pipelines and ensuring data quality. The position is remote and emphasizes collaboration with healthcare professionals, requiring strong technical skills in cloud environments and data processing frameworks.
Skills & Requirements
Must-have
SQL, relational and NoSQL databases
large-scale data pipelines in cloud environments
Databricks or Snowflake and Spark
Python and data-frame libraries
ETL/workflow orchestration tools
data quality, validation, and lineage
Nice-to-have
experience in healthcare environments
supporting AI/ML use cases
data privacy, security, and compliance
Key Requirements
Bachelor's Degree or higher in Computer Science, Mathematics, engineering or a related technical field AND 6-8 years of job related experience OR High School Diploma/GED from AND 10 years of the same job-related experience
Hands-on experience building and maintaining large-scale data pipelines in cloud environments (Azure or AWS)
Experience with Databricks or Snowflake and distributed data processing frameworks (e.g., Spark)
Familiarity with feature stores, vector databases, or model-adjacent data systems
Proficiency in Python and experience with data-frame libraries (e.g., Pandas, Polars)
Experience with ETL/workflow orchestration tools (e.g., Databricks Workflows, Azure Data Factory, AWS Glue)