O9 D&a – Data Lake (raw/domain)

Mondelez International

Mumbai, India
On-site
Gcp data services (dataflow, bigquery, dataproc, pub/sub)
Real-time streaming architectures
Data pipeline building and optimization
Support the day-to-day operations of GCP-based data pipelines, ensuring data governance, reliability, and performance optimization

Job Summary

  • Support the day-to-day operations of GCP-based data pipelines, ensuring data governance, reliability, and performance optimization.
  • Assist in ensuring the robust operation of pipelines that translate varied inbound data into a standardized global design.
  • This role requires a flexible working schedule, including potential weekend support for critical operations, while maintaining a 40-hour work week.

Matching Summary

Support the day-to-day operations of GCP-based data pipelines, ensuring data governance, reliability, and performance optimization.

Skills & Requirements

Must-have

  • GCP data services (Dataflow, BigQuery, Dataproc, Pub/Sub)
  • Real-time streaming architectures
  • Data pipeline building and optimization
  • Data cleaning, curation, and enrichment
  • ETL/ELT and SCD concepts
  • SQL, PL/SQL fluency
  • GCP cloud services experience

Nice-to-have

  • Experience with o9 global design
  • Talend ETL/Data integration tool experience
  • SAP BW/HANA integration
  • External data extraction via APIs
  • Collaboration with version control (Git Hub)

Key Requirements

  • 6+ years overall industry experience
  • 6-8 years building large scale data processing pipelines
  • Experience with data lake, data warehouse, data mart
  • Experience with Data Processing Platforms like Dataflow, Databricks
  • Experience orchestrating/scheduling data pipelines using Airflow and Alteryx

Work Rights

Not specified

Tailored Resume

Cover Letter