You'll help architect and deliver the internal frameworks that power data ingestion, transformation, observability, and self-serve analytics across the company
Job Summary
You'll help architect and deliver the internal frameworks that power data ingestion, transformation, observability, and self-serve analytics across the company.
Lead engineering efforts around CDC pipelines using Debezium, enabling reliable change capture from application databases.
Shape the developer experience for data engineers, from ingestion to publishing and consumption, with metadata-aware automation.
Matching Summary
You'll help architect and deliver the internal frameworks that power data ingestion, transformation, observability, and self-serve analytics across the company.
Skills & Requirements
Must-have
Data ingestion frameworks
Change Data Capture (CDC) pipelines
Trino serving layers
Apache Spark on AWS EMR
Python, Java or Scala proficiency
Open table formats (Delta Lake, Iceberg)
Nice-to-have
Platform-first thinker
Builder's bias
Deep collaborator
Operational discipline
Key Requirements
7+–12 years of experience
Strong systems engineering fundamentals
Production experience on AWS EMR
Experience building or managing CDC pipelines
Experience serving data using Trino or Presto
Proficiency in Python, Java or Scala
Exposure to infrastructure as code and CI/CD automation