Define and evolve the end-to-end roadmap for multi-lingual data acquisition across various modalities to train and evaluate large-scale AI models
Job Summary
Define and evolve the end-to-end roadmap for multi-lingual data acquisition across various modalities to train and evaluate large-scale AI models.
Collaborate with research teams to interpret model edge cases and failure modes, refining data collection logic into a continuous loop.
Establish data quality frameworks, including de-duplication, versioning, bias detection, and ethical filtering, leading internal policies on data privacy, consent, and transparency.
Matching Summary
Define and evolve the end-to-end roadmap for multi-lingual data acquisition across various modalities to train and evaluate large-scale AI models.