Big Data / PySpark Engineering Lead - Vice President

Citigroup

Pune, Maharashtra, India
Not specified; not specified; not specified
**
12+ years software building experience
Expert python programming skills
Hadoop spark hive kafka ecosystem
** Citi is seeking a Big Data / PySpark Engineering Lead - Vice President for its Pune location, primarily focusing on leading the development and implementation of application systems and data processing pipelines. The ideal candidate should have extensive experience in big data ecosystems, particularly with tools like Spark and Hadoop, along with strong leadership and problem-solving skills. **

Job Summary

  • This senior role leads the strategic migration of data and logic from legacy platforms to a modern Data Lakehouse environment at Citigroup.
  • The successful candidate will design and implement scalable, fault-tolerant batch and real-time data processing pipelines using PySpark and distributed frameworks.
  • Candidates must provide technical mentorship, conduct code reviews, and translate complex business requirements into robust technical specifications.

Matching Summary

Match Score: 75

** Citi is seeking a Big Data / PySpark Engineering Lead - Vice President for its Pune location, primarily focusing on leading the development and implementation of application systems and data processing pipelines. The ideal candidate should have extensive experience in big data ecosystems, particularly with tools like Spark and Hadoop, along with strong leadership and problem-solving skills. **

Salary

Not specified; Not specified; Not specified

Skills & Requirements

Must-have

  • 12+ years software building experience
  • Expert Python programming skills
  • Hadoop Spark Hive Kafka ecosystem
  • Legacy system migration leadership
  • SQL query optimization expertise
  • Unix shell scripting proficiency

Nice-to-have

  • AI tools for automation mindset
  • Root cause problem solving approach
  • Ability to explain technical decisions
  • Experience with Trino Starburst engines
  • Collibra or Informatica lineage tools

Key Requirements

  • 12+ years of software building experience
  • High volume data processing pipeline development
  • Reverse engineering of legacy SQL scripts
  • Strong computer science fundamentals required

Work Rights

Not specified

Tailored Resume

Cover Letter