Machine Learning

MDedup: Duplicate Detection with Matching Dependencies featured image

MDedup: Duplicate Detection with Matching Dependencies

Our system uses automatically discovered MDs, dataset features, and known gold standards to train a model that selects MDs as duplicate detection rules.

avatar
Ioannis Koumarelas
Experience: Enhancing Address Matching with Geocoding and Similarity Measure Selection featured image

Experience: Enhancing Address Matching with Geocoding and Similarity Measure Selection

In this paper, we study the problem of matching records that contain address information, including attributes such as Street-address and City. To facilitate this matching process …

avatar
Ioannis Koumarelas