Ai/ml Data Engineer | Unstructured Data, Pyspark, Python, Vector Search, Rag Architectures, Cloud (gcp/aws)

Synechron

Not specified
Python for data pipelines
Pyspark for distributed processing
Experience with unstructured data
Synechron is seeking an experienced AI/ML Data Engineer to design scalable data pipelines and integrate advanced language models for enterprise applications. The ideal candidate will have expertise in processing unstructured data, utilizing Python and PySpark, and experience with cloud platforms

Job Summary

  • Synechron is seeking an experienced AI/ML Data Engineer specialized in processing unstructured data.
  • The successful candidate will bridge data engineering and AI development to enable intelligent applications.
  • This role involves designing scalable data pipelines and supporting Retrieval-Augmented Generation architectures.

Matching Summary

Match Score: 85

Synechron is seeking an experienced AI/ML Data Engineer to design scalable data pipelines and integrate advanced language models for enterprise applications. The ideal candidate will have expertise in processing unstructured data, utilizing Python and PySpark, and experience with cloud platforms.

Skills & Requirements

Must-have

  • Python for data pipelines
  • PySpark for distributed processing
  • Experience with unstructured data

Nice-to-have

  • Cloud platforms like GCP or AWS
  • Data orchestration tools like Apache Airflow
  • Knowledge of data governance best practices

Key Requirements

  • Minimum of 6 years of experience
  • 2 years in unstructured data processing
  • Proven success in building scalable data pipelines

Work Rights

Not specified

Tailored Resume

Cover Letter