Working knowledge of kafka for streaming ingestion
At PwC, you will be part of a vibrant community of solvers that leads with trust and creates distinctive outcomes for our clients and communities
Job Summary
At PwC, you will be part of a vibrant community of solvers that leads with trust and creates distinctive outcomes for our clients and communities.
Leverage data to drive insights and make informed business decisions, utilizing advanced analytics techniques to help clients optimize their operations and achieve their strategic goals.
We reward your contributions, support your wellbeing, and offer inclusive benefits, flexibility programmes and mentorship that will help you thrive in work and life.
Matching Summary
At PwC, you will be part of a vibrant community of solvers that leads with trust and creates distinctive outcomes for our clients and communities.
Skills & Requirements
Must-have
Hands-on experience building data pipelines
Strong Spark (Scala and/or PySpark)
Working knowledge of Kafka for streaming ingestion
Experience with Cloudera Manager, YARN/Tez
Proficiency in Linux/Unix, Shell scripting, Git
Strong SQL and data modelling for BFSI use cases
Develop robust Spark jobs (batch and streaming)
Nice-to-have
Experience with Cloudera Data Platform (CDP)
Security and governance: Apache Ranger, Atlas
Cloud data services – AWS, Azure, or GCP
Databricks experience (Spark, Delta Lake)
Containerization and orchestration (Docker/Kubernetes)