Staff Software Engineer (python)

Duetto

United States
**
7+ years production data systems in python
Deep expertise in pyspark and distributed processing
Strong experience with iceberg lakehouse architectures
** Duetto is seeking a Staff Software Engineer specializing in Python to oversee its data infrastructure for real-time pricing solutions in the hospitality industry. The ideal candidate will have extensive experience with data systems, particularly in Python and PySpark, and will thrive in an AI-driven engineering culture. **

Job Summary

  • You will own the design, performance, and reliability of Duetto's data lakehouse while evolving the Python/PySpark pipeline framework.
  • The role involves architecting a shift from batch to near-real-time streaming using SQS-driven pipelines with Iceberg sinks.
  • Every engineer uses Claude Code and a custom multi-agent system daily as part of an AI-first engineering culture.

Matching Summary

Match Score: 75

** Duetto is seeking a Staff Software Engineer specializing in Python to oversee its data infrastructure for real-time pricing solutions in the hospitality industry. The ideal candidate will have extensive experience with data systems, particularly in Python and PySpark, and will thrive in an AI-driven engineering culture. **

Skills & Requirements

Must-have

  • 7+ years production data systems in Python
  • Deep expertise in PySpark and distributed processing
  • Strong experience with Iceberg lakehouse architectures
  • Production experience with Airflow workflow orchestrator
  • Solid AWS production experience across S3 Glue Athena

Nice-to-have

  • Working knowledge of Java for upstream systems
  • Experience with Trino or Presto for interactive SQL
  • Experience with dbt for data transformation modeling
  • Familiarity with Great Expectations data quality frameworks
  • Genuine interest in AI-assisted development tooling
  • Familiarity with hospitality reservation and rate data

Key Requirements

  • 7+ years building production data systems
  • Deep expertise in PySpark and distributed data processing
  • Strong experience with lakehouse architectures on S3
  • Production experience with Airflow or comparable orchestrator
  • Solid AWS production experience across core services

Work Rights

Not specified

Tailored Resume

Cover Letter