Efficient Processing of k Nearest Neighbor Joins using MapReduce
Summary: kNN join on MapReduce: mappers cluster objects; reducers perform per-group kNN joins. distance pruning and two approximate replica-minimization strategies cut shuffling and computation, yielding scalable, robust performance on large clusters. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Wei Lu
- 2. Yanyan Shen
- 3. Su Chen
- 4. Beng Chin Ooi
Incoming Citations (Sorted by Pagerank)
Showing 17 of 17 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 34 | Similarity Search in High Dimensions via Hashing | 1999 | VLDB | 0.00076637636 |
| 161 | LOF: Identifying Density-Based Local Outliers | 2000 | SIGMOD | 0.00039846974 |
| 447 | Efficient Parallel Set-Similarity Joins Using MapReduce | 2010 | SIGMOD | 0.00022900171 |
| 774 | Algorithms for Mining Distance-Based Outliers in Large Datasets | 1998 | VLDB | 0.00016865771 |
| 1,074 | Processing Theta-Joins using MapReduce* | 2011 | SIGMOD | 0.00014260096 |
| 1,615 | The Performance of MapReduce: An In-depth Study | 2010 | VLDB | 0.00011132319 |
| 1,715 | V-SMART-Join: A Scalable MapReduce Framework for All-Pair Similarity Joins of Multisets and Vectors | 2012 | VLDB | 0.00010803271 |
| 3,300 | Indexing the Distance: An Efficient Method to KNN Processing | 2001 | VLDB | 7.2516103e-05 |
| 5,636 | GORDER: An Efficient Method for KNN Join Processing | 2004 | VLDB | 5.3981191e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,021 | Parallel Algorithms for Constructing Range and Nearest-Neighbor Searching Data Structures | 2016 | PODS | 6.5225987e-05 |
| 1,074 | Processing Theta-Joins using MapReduce* | 2011 | SIGMOD | 0.00014260096 |
| 4,090 | Finding Near Neighbors Through Cluster Pruning | 2007 | PODS | 6.4577834e-05 |
| 5,636 | GORDER: An Efficient Method for KNN Join Processing | 2004 | VLDB | 5.3981191e-05 |
| 4,147 | Exploiting MapReduce-based Similarity Joins | 2012 | SIGMOD | 6.4096022e-05 |
| 15 | Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters | 2007 | SIGMOD | 0.0010654262 |
| 4,775 | Set Similarity Joins on MapReduce: An Experimental Survey | 2018 | VLDB | 5.9315784e-05 |
| 960 | A Comparison of Join Algorithms for Log Processing in MapReduce | 2010 | SIGMOD | 0.00015012242 |
| 447 | Efficient Parallel Set-Similarity Joins Using MapReduce | 2010 | SIGMOD | 0.00022900171 |
| 3,141 | ClusterJoin: A Similarity Joins Framework using Map-Reduce | 2014 | VLDB | 7.4829448e-05 |