In_senior Associate_ Pyspark Developer _data & Analytics _advisory _kolkjata

techcompaniesportugal.fyi

Kolkata, India
Onsite
Hadoop/spark ecosystem data pipelines
Spark (scala/pyspark), hive/impala sql
Kafka for streaming ingestion
At PwC, you will be part of a vibrant community of solvers that leads with trust and creates distinctive outcomes for our clients and communities

Job Summary

  • At PwC, you will be part of a vibrant community of solvers that leads with trust and creates distinctive outcomes for our clients and communities.
  • Develop and implement innovative solutions to optimize business performance and enhance competitive advantage.
  • We reward your contributions, support your wellbeing, and offer inclusive benefits, flexibility programmes and mentorship that will help you thrive in work and life.

Matching Summary

At PwC, you will be part of a vibrant community of solvers that leads with trust and creates distinctive outcomes for our clients and communities.

Skills & Requirements

Must-have

  • Hadoop/Spark ecosystem data pipelines
  • Spark (Scala/PySpark), Hive/Impala SQL
  • Kafka for streaming ingestion
  • NiFi for batch/near-real-time flows
  • Cloudera Manager, YARN/Tez, HDFS
  • Job orchestration using Oozie/Airflow
  • Data warehousing concepts
  • Linux/Unix, Shell scripting, Git
  • CI/CD (Jenkins/GitLab CI)
  • SQL and data modelling for BFSI

Nice-to-have

  • Cloudera Data Platform (CDP)
  • Apache Ranger, Atlas; Kerberos
  • Cloud data services (AWS, Azure, GCP)
  • Databricks experience
  • Containerization (Docker/Kubernetes)
  • Python for data processing
  • Monitoring/observability tools

Key Requirements

  • 4-7 Years of experience
  • B.E.(B.Tech)/M.E/M.Tech
  • Bachelor of Engineering, Master of Business Administration, Bachelor of Technology

Work Rights

Not specified

Tailored Resume

Cover Letter