In_senior Associate_ Pyspark Developer _data & Analytics _advisory _kolkjata

PwC UK

Kolkata, India
Onsite
Hands-on experience building data pipelines on hadoop/spark ecosystem
Strong spark (scala and/or pyspark), hive/impala sql
Working knowledge of kafka for streaming ingestion
At PwC, our people in data and analytics focus on leveraging data to drive insights and make informed business decisions

Job Summary

  • At PwC, our people in data and analytics focus on leveraging data to drive insights and make informed business decisions.
  • You will develop and implement innovative solutions to optimise business performance and enhance competitive advantage.
  • We reward your contributions, support your wellbeing, and offer inclusive benefits, flexibility programmes and mentorship that will help you thrive in work and life.

Matching Summary

At PwC, our people in data and analytics focus on leveraging data to drive insights and make informed business decisions.

Skills & Requirements

Must-have

  • Hands-on experience building data pipelines on Hadoop/Spark ecosystem
  • Strong Spark (Scala and/or PySpark), Hive/Impala SQL
  • Working knowledge of Kafka for streaming ingestion
  • Experience with Cloudera Manager, YARN/Tez, HDFS
  • Proficiency in Linux/Unix, Shell scripting, Git
  • Strong SQL and data modelling for BFSI use cases

Nice-to-have

  • Experience with Cloudera Data Platform (CDP)
  • Cloud data services – AWS, Azure, or GCP
  • Databricks experience
  • Containerization and orchestration (Docker/Kubernetes)
  • Monitoring/observability – Cloudera Manager metrics

Key Requirements

  • 4—7 Years of experience
  • B.E.(B.Tech)/M.E/M.Tech
  • Bachelor of Engineering, Master of Business Administration, Bachelor of Technology

Work Rights

Not specified

Tailored Resume

Cover Letter