You will conceptualize and implement robust data pipelines, data ingest, and data processing using open-source components like Apache Airflow, Kafka, Spark, or Trino
Job Summary
You will conceptualize and implement robust data pipelines, data ingest, and data processing using open-source components like Apache Airflow, Kafka, Spark, or Trino.
You will develop container-based analytics platforms and design efficient CI/CD pipelines based on Kubernetes for automated production deployments.
You will integrate data platforms into existing customer landscapes, including IAM, monitoring, and logging, contributing to stability, security, and transparency.
Matching Summary
You will conceptualize and implement robust data pipelines, data ingest, and data processing using open-source components like Apache Airflow, Kafka, Spark, or Trino.
Skills & Requirements
Must-have
Apache Airflow, Kafka, Spark, Trino
Kubernetes container-based platforms
CI/CD pipelines with Kubernetes
Automated deployment with Helm, ArgoCD
IAM, Monitoring, Logging integration
Java, Scala, or Python programming
Infrastructure-as-Code concepts
Nice-to-have
Continuous learning and knowledge sharing
Critical evaluation of AI tool outputs
Customer-centric data sovereignty guidance
Key Requirements
Completed degree or comparable qualification
Practical experience with Spark, Kafka, Airflow
Practical Kubernetes experience
Solid programming skills in Java, Scala, or Python