In_senior Associate_ Pyspark Developer _data & Analytics _advisory _kolkjata

PwC

Kolkata, , India
Onsite
Hadoop/spark ecosystem data pipelines
Spark (scala and/or pyspark)
Hive/impala sql
Leverage data to drive insights and make informed business decisions using advanced analytics techniques

Job Summary

  • Leverage data to drive insights and make informed business decisions using advanced analytics techniques.
  • Develop and implement innovative solutions to optimize business performance and enhance competitive advantage.
  • Be part of a vibrant community of solvers that leads with trust and creates distinctive outcomes for our clients and communities.

Matching Summary

Leverage data to drive insights and make informed business decisions using advanced analytics techniques.

Skills & Requirements

Must-have

  • Hadoop/Spark ecosystem data pipelines
  • Spark (Scala and/or PySpark)
  • Hive/Impala SQL
  • Kafka for streaming ingestion
  • NiFi for batch/near-real-time flows
  • Cloudera Manager, YARN/Tez, HDFS
  • Oozie/Airflow job orchestration
  • Linux/Unix, Shell scripting, Git
  • CI/CD (Jenkins/GitLab CI)
  • SQL and data modelling for BFSI

Nice-to-have

  • Cloudera Data Platform (CDP)
  • Apache Ranger, Atlas, Kerberos
  • AWS, Azure, or GCP data services
  • Databricks experience
  • Containerization and orchestration
  • Monitoring/observability tools

Key Requirements

  • 4—7 Years of experience
  • B.E.(B.Tech)/M.E/M.Tech
  • Bachelor of Engineering, Master of Business Administration, Bachelor of Technology

Work Rights

Not specified

Tailored Resume

Cover Letter