Hyphen Connect is looking for a Synthetic Data Engineer to design and implement synthetic data generation pipelines, ensuring high-quality data management for model training. The ideal candidate will have extensive experience in building large-scale data pipelines and knowledge of prompt engineering and bias mitigation
Job Summary
The role involves designing domain-specific synthetic data generation pipelines using self-instruct and constitutional prompting techniques.
Candidates will implement automated quality scoring and de-duplication systems to ensure high-quality data management.
The position requires managing data pipelines that directly feed into SFT and DPO training loops within the organization.
Matching Summary
Match Score: 85
Hyphen Connect is looking for a Synthetic Data Engineer to design and implement synthetic data generation pipelines, ensuring high-quality data management for model training. The ideal candidate will have extensive experience in building large-scale data pipelines and knowledge of prompt engineering and bias mitigation.