Principal Scientist, Data Science – R&d Dsdh - Therapeutics Development & Supply (tds)
Johnson & Johnson Innovative Medicine
Spring House, PA, USA
Data pipelines for diverse data sources
Python, r, sql, cloud services
Data repositories and enterprise data models
Design, build, and optimize data capture, processing, and storage solutions for advanced analytics, digital process transformation, and AI/ML applications
Job Summary
Design, build, and optimize data capture, processing, and storage solutions for advanced analytics, digital process transformation, and AI/ML applications.
Partner with data scientists, domain experts, and digital technology teams to translate business needs into high-quality data products and engineering requirements.
This is a rare opportunity to grow in one of the world’s most ambitious and fast growing R&D Data Science organizations, shaping how Therapeutics Development & Supply data powers next-generation therapies.
Matching Summary
Design, build, and optimize data capture, processing, and storage solutions for advanced analytics, digital process transformation, and AI/ML applications.
Skills & Requirements
Must-have
Data pipelines for diverse data sources
Python, R, SQL, cloud services
Data repositories and enterprise data models
AI/ML data readiness
Data quality and performance standards
Software development best practices
Nice-to-have
Regulated data environments
High-dimensional data experience
MLOps and model deployment workflows
Manufacturing and laboratory systems knowledge
Knowledge graph/ontology architectures
Key Requirements
Advanced degree in Engineering, Data Science, Life Sciences, Computer Science, or related field
3+ years of experience in data engineering
Experience with NoSQL and graph databases
Strong analytical, problem-solving, and stakeholder-management skills