Data Normalization

Data Preparation for Duplicate Detection featured image

Data Preparation for Duplicate Detection

We propose the first workflow that systematically integrates data preparation operations before duplicate detection, improving AUC-PR by up to 19%.

avatar
Ioannis Koumarelas
Experience: Enhancing Address Matching with Geocoding and Similarity Measure Selection featured image

Experience: Enhancing Address Matching with Geocoding and Similarity Measure Selection

In this paper, we study the problem of matching records that contain address information, including attributes such as Street-address and City. To facilitate this matching process …

avatar
Ioannis Koumarelas