In_senior Associate_ Pyspark Developer _data & Analytics _advisory _kolkjata

PwC

Mumbai, India
Onsite
Hadoop/spark ecosystem data pipelines
Spark (scala/pyspark)
Hive/impala sql
At PwC, you will be part of a vibrant community of solvers that leads with trust and creates distinctive outcomes for our clients and communities

Job Summary

  • At PwC, you will be part of a vibrant community of solvers that leads with trust and creates distinctive outcomes for our clients and communities.
  • Develop and implement innovative solutions to optimise business performance and enhance competitive advantage.
  • We reward your contributions, support your wellbeing, and offer inclusive benefits, flexibility programmes and mentorship that will help you thrive in work and life.

Matching Summary

At PwC, you will be part of a vibrant community of solvers that leads with trust and creates distinctive outcomes for our clients and communities.

Skills & Requirements

Must-have

  • Hadoop/Spark ecosystem data pipelines
  • Spark (Scala/PySpark)
  • Hive/Impala SQL
  • Kafka streaming ingestion
  • NiFi for batch flows
  • Cloudera Manager, YARN/Tez, HDFS
  • Oozie/Airflow job orchestration
  • Linux/Unix, Shell scripting, Git
  • CI/CD (Jenkins/GitLab CI)
  • SQL and data modelling for BFSI

Nice-to-have

  • Cloudera Data Platform (CDP)
  • Apache Ranger, Atlas, Kerberos
  • Cloud data services (AWS, Azure, GCP)
  • Databricks experience
  • Containerization (Docker/Kubernetes)
  • Scala build tools
  • Grafana/Prometheus monitoring

Key Requirements

  • 4—7 Years of experience
  • B.E.(B.Tech)/M.E/M.Tech
  • Bachelor of Engineering, Master of Business Administration, Bachelor of Technology

Work Rights

Not specified

Tailored Resume

Cover Letter