Senior Data Engineer (python, Pyspark, Kafka)

Bosch Rexroth Pty. Ltd.

Bangalore, India
On-site
Strong proficiency in python for data manipulation
Experience with pyspark for large-scale datasets
Hands-on experience with apache kafka and streaming
The role involves building real-time data streaming pipelines using Apache Kafka Connect and Kafka Streams

Job Summary

  • The role involves building real-time data streaming pipelines using Apache Kafka Connect and Kafka Streams.
  • Candidates will develop high-performance data transformation jobs on large-scale datasets using PySpark.
  • This position is based at Bosch Global Software Technologies, the largest software development center of Bosch outside Germany.

Matching Summary

The role involves building real-time data streaming pipelines using Apache Kafka Connect and Kafka Streams.

Skills & Requirements

Must-have

  • Strong proficiency in Python for data manipulation
  • Experience with PySpark for large-scale datasets
  • Hands-on experience with Apache Kafka and streaming
  • Knowledge of Hadoop ecosystem including HDFS and Hive
  • Proficiency in SQL and relational databases

Nice-to-have

  • Familiarity with dimensional modeling concepts
  • Experience with version control systems like Git
  • Exposure to Flink or Spark Streaming frameworks

Key Requirements

  • B.Tech or M.Tech degree required
  • 6 years of relevant experience implied by senior title
  • No specific work authorization requirements mentioned

Work Rights

Not specified

Tailored Resume

Cover Letter