Senior Data Scientist, Product Data

Impact

Cape Town, South Africa
On-site
Product data quality
Classification modeling
Deduplication and entity resolution
Impact.com is seeking a Senior Data Scientist specializing in Product Data Quality to enhance the quality of product data across their platform. The role involves building and maintaining scalable data pipelines and ML models to improve catalog hygiene and transaction accuracy

Job Summary

  • Own the analytical and technical foundation of product data quality across the impact.com ecosystem.
  • Develop, deploy, and maintain ML models for automated product categorization and taxonomy assignment.
  • Take models and analytics prototypes from POC to production, owning deployment, testing, monitoring, and iteration.

Matching Summary

Match Score: 85

Impact.com is seeking a Senior Data Scientist specializing in Product Data Quality to enhance the quality of product data across their platform. The role involves building and maintaining scalable data pipelines and ML models to improve catalog hygiene and transaction accuracy.

Skills & Requirements

Must-have

  • Product data quality
  • Classification modeling
  • Deduplication and entity resolution
  • Search and retrieval infrastructure
  • Python and SQL proficiency
  • ML model deployment

Nice-to-have

  • Vector database infrastructure
  • Product graph infrastructure
  • Collaboration with category experts
  • Manufacturer data quality evaluation

Key Requirements

  • 5+ years in data science, ML engineering, or analytics engineering
  • 2+ years focused on product data, catalog quality, entity resolution, search/retrieval, or e-commerce/marketplace analytics
  • Proven ability to build production-grade data pipelines and deploy ML models independently
  • Strong software engineering fundamentals
  • Demonstrated experience analyzing and improving large-scale structured data quality
  • Track record building and deploying classification models, ranking systems, or search/retrieval pipelines in production
  • Proficiency with ML libraries (scikit-learn, XGBoost, LightGBM, PyTorch/TensorFlow)

Work Rights

Not specified

Tailored Resume

Cover Letter