Database Paper Browser

Back to papers

Exploiting MapReduce-based Similarity Joins

Summary: MRSimJoin: a multi-round MapReduce algorithm for similarity joins in cloud systems, partitioning data to fit subsets on a single node. Metric-space, Hadoop-based, with real-world uses (image features, bibliographic data) and EC2-scale experiments. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4584
Venue
SIGMOD
Year
2012
Pagerank
6.4096022e-05
Overall Rank
4,147 | 71.16%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 8 of 8 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 2 of 2 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
15 Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters 2007 SIGMOD 0.0010654262
447 Efficient Parallel Set-Similarity Joins Using MapReduce 2010 SIGMOD 0.00022900171
Previous Page 1 / 1 Next

Semantically Similar Papers