The role involves designing and maintaining scalable data pipelines for the Net-Zero Data Public Utility to provide open public good data
Job Summary
The role involves designing and maintaining scalable data pipelines for the Net-Zero Data Public Utility to provide open public good data.
Candidates will be responsible for extracting, transforming, and validating data from diverse sources including APIs and structured files like Excel and Parquet.
The position requires implementing data quality checks and providing mentorship to junior team members while adhering to industry best practices.
Matching Summary
The role involves designing and maintaining scalable data pipelines for the Net-Zero Data Public Utility to provide open public good data.
Skills & Requirements
Must-have
7+ years of data engineering experience
Proficiency in Python programming language
Experience building and managing data pipelines
Knowledge of data warehousing and ETL processes
Ability to translate business logic into code
Nice-to-have
Mentorship and guidance for junior team members
Collaboration with data scientists and analysts
Experience with Pandas and Pydantic libraries
Strong documentation skills for data lineage
Adherence to CI/CD deployment practices
Key Requirements
7+ years of experience in data engineering
Proficiency in Python programming
Experience with data warehousing and ETL
Excellent problem-solving and communication skills