Senior Backend Engineer, Data Modeling And Ingestion Platform

Udio

New York, United States
Base: $180,000 - $220,000; equity: not specified; ...
On-site
Large, heterogeneous datasets
Entity resolution
Python for data processing
Build high-throughput bulk ingestion workflows to integrate datasets from multiple external providers

Job Summary

  • Build high-throughput bulk ingestion workflows to integrate datasets from multiple external providers.
  • Design and implement scalable entity-resolution solutions, including record linking, deduplication, clustering, and conflict arbitration.
  • Define and track data quality indicators, such as overlap metrics, match precision/recall, duplicate rates, and completeness.

Matching Summary

Build high-throughput bulk ingestion workflows to integrate datasets from multiple external providers.

Salary

Base: $180,000 - $220,000; Equity: Not specified; Benefits: Highly competitive

Skills & Requirements

Must-have

  • large, heterogeneous datasets
  • entity resolution
  • Python for data processing
  • Dataflow/Beam
  • BigQuery
  • bulk ingestion workflows
  • matching logic and decision rules

Nice-to-have

  • Google Cloud Platform architecture
  • distributed compute frameworks
  • JAX-based ML pipelines
  • multihost training setups
  • TFRecords
  • ranking, clustering, similarity modeling

Key Requirements

  • Experience with large, heterogeneous datasets
  • Strong background in entity resolution
  • Proficiency in Python
  • Experience with BigQuery, Google Dataflow/Apache Beam
  • Familiarity with data validation, normalization, reconciliation
  • Ability to craft well-structured matching and decision strategies
  • Comfortable iterating quickly on pragmatic solutions

Work Rights

Not specified

Tailored Resume

Cover Letter