Ioannis Koumarelas, PhD
Ioannis Koumarelas, PhD
Home
Skills
Experience
Accomplishments
Publications
Contact
Light
Dark
Automatic
1
MDedup: Duplicate Detection with Matching Dependencies
Our system uses automatically discovered MDs, various dataset features, and known gold standards to train a model that selects MDs as duplicate detection rules. Once trained, the model can select useful MDs for duplicate detection on any new dataset.
Ioannis Koumarelas
,
Thorsten Papenbrock
,
Felix Naumann
PDF
Cite
Code
Video
Supplementary code utilities
Repeatability
Efficient Discovery of Matching Dependencies
We focus on the efficient discovery of all interesting MDs in real-world datasets. For this purpose, we propose HyMD, a novel MD discovery algorithm that finds all minimal, non-trivial MDs within given similarity boundaries.
Philipp Schirmer
,
Thorsten Papenbrock
,
Ioannis Koumarelas
,
Felix Naumann
PDF
Cite
Towards Progressive Search-driven Entity Resolution
This paper describes a first step to solve the problem of progressive search-driven Entity Resolution
:
resolving all the entities described by a user through a handful of keywords, progressively (according to an order by clause)
Alberto Pietrangelo
,
Giovanni Simonini
,
Sonia Bergamaschi
,
Felix Naumann
,
Ioannis Koumarelas
PDF
Cite
Combination of Rule-based and Textual Similarity Approaches to Match Financial Entities
Matching financial entities (FEs) is important for many private and governmental organizations. In this paper we describe the problem of matching such FEs across three datasets
:
FFIEC, LEI and SEC. We were able to achieve an f-measure of 93.78% in the first task, which is comparable to the maximum 97.44%, and 70.44% for the second task, where the maximum is 88.38%.
Ahmad Samiei
,
Ioannis Koumarelas
,
Michael Loster
,
Felix Naumann
PDF
Cite
Binary Theta-Joins using MapReduce: Efficiency Analysis and Improvements
We deal with binary theta-joins in a MapReduce environment, and we make two contributions. First, we show that the best known algorithm …
Ioannis Koumarelas
,
Athanasios Naskos
,
Anastasios Gounaris
PDF
Cite
Code
Cite
×