Engineer - Pyspark/aws

Barclays

Pune, India
Pyspark distributed data processing
Aws data platforms
Advanced sql query writing
Build and maintain the systems that collect, store, process, and analyse data, such as data pipelines, data warehouses and data lakes to ensure that all data is accurate, accessible, and secure

Job Summary

  • Build and maintain the systems that collect, store, process, and analyse data, such as data pipelines, data warehouses and data lakes to ensure that all data is accurate, accessible, and secure.
  • Design and implementation of data warehoused and data lakes that manage the appropriate data volumes and velocity and adhere to the required security measures.
  • Collaboration with data scientist to build and deploy machine learning models.

Matching Summary

Build and maintain the systems that collect, store, process, and analyse data, such as data pipelines, data warehouses and data lakes to ensure that all data is accurate, accessible, and secure.

Skills & Requirements

Must-have

  • PySpark distributed data processing
  • AWS data platforms
  • Advanced SQL query writing
  • Hadoop big data tools
  • Linux Shell Scripting automation

Nice-to-have

  • Agile project management
  • Continuous improvement mindset
  • Collaboration with data scientists
  • Risk and controls management
  • Stakeholder management

Key Requirements

  • AWS Certification
  • Scrum & Agile Methodology familiarity

Work Rights

Not specified

Tailored Resume

Cover Letter