Binary Theta-Joins using MapReduce: Efficiency Analysis and Improvements
Jan 1, 2014·
,,·
1 min read
Ioannis Koumarelas
Athanasios Naskos
Anastasios Gounaris

Abstract
We deal with binary theta-joins in a MapReduce environment, and we make two contributions. First, we show that the best known algorithm to date for this problem can reach the optimal trade-off between the size of the input a reducer can receive and the incurred communication cost when the join selectivity is high. Second, when the join selectivity is low, we present improvements upon the state-of-the-art with a view to decreasing the communication cost and the maximum load a reducer can receive, taking also into account the load imbalance across the reducers.
Type
Publication
In EDBT/ICDT 2014 Joint Conference
Note
Click the Cite button above to enable visitors to import publication metadata into their reference management software.
Note
Create your slides in Markdown - click the Slides button to check out the example.
Add supplementary notes, full text, or examples here. You can include code, math, and images.