Data Preparation for Duplicate Detection
We propose the first workflow that systematically integrates data preparation operations before duplicate detection, improving AUC-PR by up to 19%.
We propose the first workflow that systematically integrates data preparation operations before duplicate detection, improving AUC-PR by up to 19%.
In this paper, we study the problem of matching records that contain address information, including attributes such as Street-address and City. To facilitate this matching process …