Data Engineer (scala) - 131305

GFT Technologies Canada Inc

**
Apache spark or pyspark experience
Scala programming language proficiency
Aws services (glue, s3, emr, athena)
** GFT Technologies Canada Inc is seeking a Data Engineer with expertise in distributed environments, specifically with skills in Scala and Apache Spark, to design and maintain scalable data pipelines. The role requires collaboration with multidisciplinary teams to ensure data quality and governance while supporting technical architectural decisions. **

Job Summary

  • The role involves designing, building, and maintaining scalable and resilient data pipelines for large volumes of structured and unstructured data.
  • Candidates must ensure data quality, consistency, and governance while collaborating with software engineers and data scientists.
  • The ideal candidate embraces challenges, transforms ideas into creative solutions, and thrives in a collaborative, multidisciplinary environment.

Matching Summary

Match Score: 75

** GFT Technologies Canada Inc is seeking a Data Engineer with expertise in distributed environments, specifically with skills in Scala and Apache Spark, to design and maintain scalable data pipelines. The role requires collaboration with multidisciplinary teams to ensure data quality and governance while supporting technical architectural decisions. **

Skills & Requirements

Must-have

  • Apache Spark or PySpark experience
  • Scala programming language proficiency
  • AWS services (Glue, S3, EMR, Athena)
  • Distributed environment engineering
  • Batch and streaming data processing
  • Kafka or Amazon MSK familiarity

Nice-to-have

  • Delta Lake, Apache Hudi, or Iceberg knowledge
  • Legacy system modernization experience
  • CI/CD for data tools like dbt or Airflow
  • Collaborative team mindset
  • Independent time management skills
  • Creative problem-solving abilities

Key Requirements

  • Experience with distributed data environments
  • Proficiency in Scala and Python
  • Knowledge of relational and NoSQL databases
  • Familiarity with ETL/ELT pipeline modeling

Work Rights

Not specified

Tailored Resume

Cover Letter