Binary Theta-Joins using MapReduce: Efficiency Analysis and Improvements

Jan 1, 2014·
Ioannis Koumarelas
Ioannis Koumarelas
,
Athanasios Naskos
,
Anastasios Gounaris
· 1 min read
Abstract
We deal with binary theta-joins in a MapReduce environment, and we make two contributions. First, we show that the best known algorithm to date for this problem can reach the optimal trade-off between the size of the input a reducer can receive and the incurred communication cost when the join selectivity is high. Second, when the join selectivity is low, we present improvements upon the state-of-the-art with a view to decreasing the communication cost and the maximum load a reducer can receive, taking also into account the load imbalance across the reducers.
Type
Publication
In EDBT/ICDT 2014 Joint Conference
Note

Click the Cite button above to enable visitors to import publication metadata into their reference management software.

Note

Create your slides in Markdown - click the Slides button to check out the example.

Add supplementary notes, full text, or examples here. You can include code, math, and images.