Data Engineering

ioh

Indonesia
On-site
Build and manage data pipelines
Etl processes for indonesian text data
Data cleaning and pre-processing
As a Data Engineer, you will play a crucial role in building and managing the data pipelines that are essential for training and fine-tuning our Large Language Models (LLMs), with a specific focus on the Indonesian language

Job Summary

  • As a Data Engineer, you will play a crucial role in building and managing the data pipelines that are essential for training and fine-tuning our Large Language Models (LLMs), with a specific focus on the Indonesian language.
  • You will be responsible for designing, building, and maintaining a robust and scalable data infrastructure, collaborating closely with Data Scientists and Machine Learning Engineers.
  • Key responsibilities include designing, developing, and maintaining ETL processes, gathering and integrating complex datasets, performing data cleaning and pre-processing, and ensuring data quality for AI model development.

Matching Summary

As a Data Engineer, you will play a crucial role in building and managing the data pipelines that are essential for training and fine-tuning our Large Language Models (LLMs), with a specific focus on the Indonesian language.

Skills & Requirements

Must-have

  • Build and manage data pipelines
  • ETL processes for Indonesian text data
  • Data cleaning and pre-processing
  • Design scalable data architecture
  • Ensure data integrity and accuracy
  • Python, SQL, and Scala proficiency
  • Big data technologies (Spark, Hadoop, Kafka)

Nice-to-have

  • Cloud platform experience (AWS, GCP, Azure)
  • DevOps/DataOps principles
  • Strong analytical and problem-solving abilities
  • Excellent communication and teamwork skills

Key Requirements

  • 3-5 years of hands-on data engineering experience
  • Experience with big data and machine learning projects
  • Bachelor’s degree in Computer Science or related field

Work Rights

Not specified

Tailored Resume

Cover Letter