Efficient Multi-way Theta-Join Processing Using MapReduce
Summary: Cost-based scheduling for multi-way Theta-joins in a shared-nothing MapReduce paradigm; metrics for single and multi-job plans. Introduces a chain-typed Theta-join that can be done in one MapReduce job, beating Pig/Hive solutions. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Xiaofei Zhang
- 2. Lei Chen
- 3. Min Wang
Incoming Citations (Sorted by Pagerank)
Showing 12 of 12 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 335 | Optimization of Real Conjunctive Queries | 1993 | PODS | 0.00027036073 |
| 447 | Efficient Parallel Set-Similarity Joins Using MapReduce | 2010 | SIGMOD | 0.00022900171 |
| 794 | Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing) | 2010 | VLDB | 0.00016605103 |
| 947 | MRShare: Sharing Across Multiple Queries in MapReduce | 2010 | VLDB | 0.00015114576 |
| 1,074 | Processing Theta-Joins using MapReduce* | 2011 | SIGMOD | 0.00014260096 |
| 1,499 | Apache Hadoop Goes Realtime at Facebook | 2011 | SIGMOD | 0.00011675192 |
| 1,615 | The Performance of MapReduce: An In-depth Study | 2010 | VLDB | 0.00011132319 |
| 3,539 | Scheduling Shared Scans of Large Data Files | 2008 | VLDB | 6.9956521e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,797 | Runtime Optimization of Join Location in Parallel Data Management Systems | 2017 | VLDB | 4.1945683e-05 |
| 3,898 | Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment | 2021 | VLDB | 6.6551268e-05 |
| 447 | Efficient Parallel Set-Similarity Joins Using MapReduce | 2010 | SIGMOD | 0.00022900171 |
| 960 | A Comparison of Join Algorithms for Log Processing in MapReduce | 2010 | SIGMOD | 0.00015012242 |
| 2,044 | Optimization of Multi-Way Join Queries for Parallel Execution | 1991 | VLDB | 9.6953608e-05 |
| 15 | Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters | 2007 | SIGMOD | 0.0010654262 |
| 11,890 | Let's Rethink Join Optimization in Distributed Systems | 2015 | CIDR | 4.1945683e-05 |
| 3,703 | Multi-Query Optimization in MapReduce Framework | 2014 | VLDB | 6.8289978e-05 |
| 1,939 | From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System | 2015 | SIGMOD | 0.00010025655 |
| 1,074 | Processing Theta-Joins using MapReduce* | 2011 | SIGMOD | 0.00014260096 |