Data Engineer

Cohere Health

Hyderabad, India
On-site
4-5 years data engineering experience
Python sql pyspark/spark proficiency
Aws s3 glue athena experience
The role involves developing batch data pipelines and building reusable ingestion workflows on an AWS-based lakehouse platform

Job Summary

  • The role involves developing batch data pipelines and building reusable ingestion workflows on an AWS-based lakehouse platform.
  • Cohere Health delivers AI-powered solutions to streamline healthcare access and has been named to the 2025 Inc. 5000 list.
  • Candidates will collaborate with senior engineers and analytics teams to deliver production-grade data solutions for operational use cases.

Matching Summary

The role involves developing batch data pipelines and building reusable ingestion workflows on an AWS-based lakehouse platform.

Skills & Requirements

Must-have

  • 4-5 years data engineering experience
  • Python SQL PySpark/Spark proficiency
  • AWS S3 Glue Athena experience
  • DBT and Airflow workflow development
  • Apache Iceberg dataset management

Nice-to-have

  • Schema evolution and partition optimization
  • Athena query performance troubleshooting
  • Production issue resolution skills
  • Empathetic and candid team collaboration

Key Requirements

  • 4–5 years of data engineering experience
  • Strong hands-on experience with Python and SQL
  • Experience with AWS services including S3, Glue, and Athena
  • Familiarity with Apache Iceberg or similar table formats

Work Rights

Not specified

Tailored Resume

Cover Letter