MDedup: Duplicate Detection with Matching Dependencies
Our system uses automatically discovered MDs, dataset features, and known gold standards to train a model that selects MDs as duplicate detection rules.
Our system uses automatically discovered MDs, dataset features, and known gold standards to train a model that selects MDs as duplicate detection rules.
In this paper, we study the problem of matching records that contain address information, including attributes such as Street-address and City. To facilitate this matching process …