Principal Engineer, Data Infrastructure

The New York Times

Remote, US
$198,000 — $220,000 usd py
**
Data lake on aws s3 with apache iceberg
Apache flink for stream processing
Aws glue (spark) for etl
** The New York Times is seeking a Principal Engineer for Data Infrastructure to lead the architecture and development of their data and machine learning systems. This remote position focuses on creating scalable, reliable, and efficient data processing and ML platforms while mentoring other engineers and collaborating across teams. **

Job Summary

  • Lead the architecture and evolution of our data and machine learning infrastructure, shaping the foundation for data-driven products, analytics, and AI applications.
  • Design systems that enable large-scale data processing, reliable pipelines, and efficient machine learning development, from feature engineering to real-time model serving.
  • Ensure that both data and ML platforms are scalable, reliable, cost-efficient, and compliant with privacy and governance standards.

Matching Summary

Match Score: 75

** The New York Times is seeking a Principal Engineer for Data Infrastructure to lead the architecture and development of their data and machine learning systems. This remote position focuses on creating scalable, reliable, and efficient data processing and ML platforms while mentoring other engineers and collaborating across teams. **

Salary

$198,000 — $220,000 USD

Skills & Requirements

Must-have

  • Data lake on AWS S3 with Apache Iceberg
  • Apache Flink for stream processing
  • AWS Glue (Spark) for ETL
  • dbt/Athena for analytical data models
  • Amazon DynamoDB for low-latency applications
  • Google BigQuery for analytics and BI
  • Confluent Kafka for real-time streaming
  • Fivetran for file and change-data ingestion

Nice-to-have

  • Familiarity with data lakehouse paradigm
  • Familiarity with medallion architecture
  • Experience with vector databases
  • Experience with distributed training frameworks
  • Experience with LLM stacks

Key Requirements

  • 10+ years of software engineering experience
  • Proven ability to influence technical direction
  • Proven expertise in data processing frameworks
  • Deep knowledge of ML infrastructure
  • Strong programming skills in Python and Java/Go
  • Experience designing scalable, reliable, cost-efficient systems
  • Cloud platform experience (AWS, GCP)
  • Familiarity with Kubernetes

Work Rights

Not specified

Tailored Resume

Cover Letter