Data Engineer, Data Center Capacity Delivery

Amazon

Seattle, WA, US
Not specified; not specified; not specified
On-site
Python scripting for etl pipelines
Aws glue and lambda services
Sql and spark development
The role involves developing and maintaining automated ETL pipelines using Python, Spark, SQL, and various AWS serverless components

Job Summary

  • The role involves developing and maintaining automated ETL pipelines using Python, Spark, SQL, and various AWS serverless components.
  • Candidates will work on a world-class data lake that drives multi-billion dollar decisions while aiming to democratize data access.
  • The position requires implementing data security solutions including encryption and user access controls for enterprise-scale implementations.

Matching Summary

The role involves developing and maintaining automated ETL pipelines using Python, Spark, SQL, and various AWS serverless components.

Salary

Not specified; Not specified; Not specified

Skills & Requirements

Must-have

  • Python scripting for ETL pipelines
  • AWS Glue and Lambda services
  • SQL and Spark development
  • Data warehouse optimization techniques
  • S3 data lake architecture

Nice-to-have

  • Experience with Redshift Spectrum
  • Knowledge of EMR and Kinesis
  • Strong software engineering best practices
  • Collaboration with diverse engineering teams
  • Passion for solving business problems with data

Key Requirements

  • Proficiency in distributed systems and data modeling
  • Experience with DDL and physical table optimization
  • Ability to gather requirements from internal business customers

Work Rights

Not specified

Tailored Resume

Cover Letter