Data Preparation for Duplicate Detection
We propose the first workflow that systematically integrates data preparation operations before duplicate detection, improving AUC-PR by up to 19%.
We propose the first workflow that systematically integrates data preparation operations before duplicate detection, improving AUC-PR by up to 19%.