This position supervises and participates in the development of batch and real-time data pipelines utilizing various data analytics processing frameworks
Job Summary
This position supervises and participates in the development of batch and real-time data pipelines utilizing various data analytics processing frameworks.
The role involves performing extract, transform, load (ETL) data conversions and facilitating data cleansing and enrichment for internal and external sources.
Candidates must possess a strong understanding of the data life cycle stages including collection, transformation, secure storage, and accessibility.
Matching Summary
This position supervises and participates in the development of batch and real-time data pipelines utilizing various data analytics processing frameworks.
Skills & Requirements
Must-have
Python Java Scala C++ programming
Batch and real-time data pipeline development
ETL tool capabilities and data cleansing
Cloud services platform knowledge AWS Azure GCP
Database systems and data warehousing solutions
Nice-to-have
Strong foundational knowledge in software development
Experience with analytical reporting tools PBI Looker Qlik
Ability to synthesize disparate data sources into reusable assets
Understanding of distributed systems and business problems
Capacity to guide team members on data analysis findings
Key Requirements
Bachelor's degree in MIS mathematics statistics or computer science
Equivalent job experience in lieu of degree
Literate in programming languages for statistical modeling and analysis