Data Engineer Intern

GoodRx

San Francisco, CA, US
Aws, kubernetes, airflow, redshift, emr
Event bridge, kinesis, aws lambda, s3, glue
Python, pyspark, active mq, kafka, kinesis
Collaborate with product managers, data scientists, data analysts and engineers to define requirements and data specifications

Job Summary

  • Collaborate with product managers, data scientists, data analysts and engineers to define requirements and data specifications.
  • Develop, deploy and maintain data processing pipelines using cloud technology such as AWS, Kubernetes, Airflow, Redshift, EMR.
  • We’re committed to growing and empowering a more inclusive community within our company and industry.

Matching Summary

Collaborate with product managers, data scientists, data analysts and engineers to define requirements and data specifications.

Skills & Requirements

Must-have

  • AWS, Kubernetes, Airflow, Redshift, EMR
  • Event Bridge, Kinesis, AWS Lambda, S3, Glue
  • Python, pySpark, Active MQ, Kafka, Kinesis
  • Complex SQL and ETL development
  • Data quality processes and validation
  • Full lifecycle deployments with testing

Nice-to-have

  • Innately curious and organized
  • Quickly learn complex domains
  • Contribute to inclusive community

Key Requirements

  • Bachelor’s degree in analytics, statistics, engineering, math, economics, science or related discipline
  • Experience in Cloud data spaces (AWS, Azure or GCP)
  • Experience engineering data pipelines with big data technologies
  • Experience writing complex SQL and ETL development
  • Demonstrated ability to analyze large data sets
  • Familiarity with AWS Services
  • Experience using Jira, GitHub, Docker, CodeFresh, Terraform
  • Experience with data quality processes

Work Rights

Not specified

Tailored Resume

Cover Letter