You will be responsible for designing and maintaining optimal data pipeline architecture to ensure efficient, scalable, and reliable data flow across the organization
Job Summary
You will be responsible for designing and maintaining optimal data pipeline architecture to ensure efficient, scalable, and reliable data flow across the organization.
Build and optimize infrastructure for extraction, transformation, and loading (ETL/ELT) of data from diverse sources using SQL, Python, distributed data processing frameworks, cloud data platforms, and cloud services.
We take care of you, so you can take care of business, providing everything you and your career need to thrive at S&P Global.
Matching Summary
You will be responsible for designing and maintaining optimal data pipeline architecture to ensure efficient, scalable, and reliable data flow across the organization.
Skills & Requirements
Must-have
Python
SQL
Cloud data platforms
AWS or other cloud environments
Distributed data processing frameworks
CI/CD platforms
RESTful services
Nice-to-have
System design concepts
Object-oriented programming principles
Agile environments
Cross-functional collaboration
Innovative industry leader
Key Requirements
Minimum 5 years of experience in a data engineering role
Hands-on experience with Databricks
Experience with workflow orchestration/scheduling platforms
Experience with version control systems
Experience maintaining and developing software in production environments
Experience with modern Python API frameworks
Strong understanding of big data / distributed systems technologies