Big Data Engineer

techcompaniesportugal.fyi

Prague, Czech Republic
**
Python and pyspark
Databricks or similar platform
Apache spark and distributed systems
** The Big Data Engineer position at PwC in Prague and Brno involves designing, developing, and deploying data pipelines using Databricks/Spark, focusing on optimizing processing in distributed clusters. The ideal candidate should possess advanced knowledge of Python, experience with Databricks or similar platforms, and familiarity with cloud platforms such as Azure, AWS, or GCP. **

Job Summary

  • Our agile Digital Enablement team focuses on Big Data, Data Science, and Data Analytics for clients globally, developing data and technology solutions for better business decisions and cost optimization.
  • Your role will involve designing, developing, and deploying data pipelines in Databricks/Spark, optimizing distributed cluster processing, and collaborating on data models and integration solutions.
  • We offer concentrated experience, rapid career growth, fair salary plus overtime, flexible working hours, generous paid time off, a comprehensive benefit program, and extensive development and training opportunities.

Matching Summary

Match Score: 75

** The Big Data Engineer position at PwC in Prague and Brno involves designing, developing, and deploying data pipelines using Databricks/Spark, focusing on optimizing processing in distributed clusters. The ideal candidate should possess advanced knowledge of Python, experience with Databricks or similar platforms, and familiarity with cloud platforms such as Azure, AWS, or GCP. **

Skills & Requirements

Must-have

  • Python and PySpark
  • Databricks or similar platform
  • Apache Spark and distributed systems
  • Relational databases and PL/SQL
  • Cloud platforms (Azure, AWS, GCP)
  • Object-oriented programming (OOP) principles
  • Data models (DWH, ODS)

Nice-to-have

  • Apache Kafka streaming
  • Data quality tools
  • Oracle DB development
  • Tuning Spark jobs

Key Requirements

  • Advanced Python and PySpark experience
  • Experience with Databricks or Cloudera
  • Understanding of Apache Spark
  • Experience with relational databases
  • Cloud platform experience
  • Object-oriented programming knowledge
  • Data model design experience
  • Data platform architecture knowledge
  • Czech language (excellent)
  • English language (communicative)

Work Rights

Not specified

Tailored Resume

Cover Letter