Database Paper Browser

Back to papers

From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System

Summary: Unifies communication-optimal distributed join evaluation with worst-case optimal sequential algorithms for cyclic joins on parallel architectures. Demonstrates practical optimizations and a unified evaluation of both approaches, enabling efficient, scalable join processing in DBMS. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5077
Venue
SIGMOD
Year
2015
Pagerank
0.00010025655
Overall Rank
1,939 | 86.52%
DOI
10.1145/2723372.2750545

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 30 of 30 citing papers.

Rank Citing Paper Year Venue Pagerank
342 EmptyHeaded: A Relational Engine for Graph Processing 2016 SIGMOD 0.00026795977
1,328 Hypertree Decompositions: Questions and Answers 2016 PODS 0.00012565612
1,333 Optimizing Subgraph Queries by Combining Binary and Worst-Case Optimal Joins 2019 VLDB 0.00012523806
1,369 Random Sampling over Joins Revisited 2018 SIGMOD 0.00012339777
2,254 Two-Level Sampling for Join Size Estimation 2017 SIGMOD 9.1897043e-05
2,275 Adopting Worst-Case Optimal Joins in Relational Database Systems 2020 VLDB 9.1262202e-05
3,528 Distributed Data Deduplication 2016 VLDB 7.0066139e-05
3,729 Sortledton: a Universal, Transactional Graph Data Structure 2022 VLDB 6.8133526e-05
3,922 Pushing Data-Induced Predicates Through Joins in Big-Data Clusters 2020 VLDB 6.6291079e-05
3,982 The Myria Big Data Management and Analytics System and Cloud Service 2017 CIDR 6.5651188e-05
4,556 Distributed Subgraph Matching on Timely Dataflow 2019 VLDB 6.0883757e-05
6,619 Near-Optimal Distributed Band-Joins through Recursive Partitioning 2020 SIGMOD 4.9910152e-05
6,824 Computing Join Queries with Functional Dependencies 2016 PODS 4.9144789e-05
7,126 Debunking the Myth of Join Ordering: Toward Robust SQL Analytics 2025 SIGMOD 4.8232367e-05
7,153 Submodularity of Distributed Join Computation 2018 SIGMOD 4.8153963e-05
7,599 Quill: Efficient, Transferable, and Rich Analytics at Scale 2016 VLDB 4.7003593e-05
7,760 G-SQL: Fast Query Processing via Graph Exploration 2016 VLDB 4.6589413e-05
8,432 SPRINTER: A Fast n-ary Join Query Processing Method for Complex OLAP Queries 2020 SIGMOD 4.5153924e-05
8,530 HYPERSONIC: A Hybrid Parallelization Approach for Scalable Complex Event Processing 2022 SIGMOD 4.4937074e-05
9,082 JoinSketch: A Sketch Algorithm for Accurate and Unbiased Inner-Product Estimation 2023 SIGMOD 4.3998984e-05
9,327 SODA: A Set of Fast Oblivious Algorithms in Distributed Secure Data Analytics 2023 VLDB 4.3556432e-05
9,330 Parallel Query Processing: To Separate Communication from Computation 2022 SIGMOD 4.3556432e-05
10,238 TurboLynx: Schemaless Graph Engine Strikes Back for General-Purpose Analytics 2026 VLDB 4.1945683e-05
10,488 HoneyComb: A Parallel Worst-Case Optimal Join on Multicores 2025 SIGMOD 4.1945683e-05
10,514 cuMatch: A GPU-based Memory-Efficient Worst-case Optimal Join Processing Method for Subgraph Queries with Complex Patterns 2025 SIGMOD 4.1945683e-05
11,358 Scaling Equi-Joins 2022 SIGMOD 4.1945683e-05
11,831 Logical Aspects of Massively Parallel and Distributed Systems 2016 PODS 4.1945683e-05
11,835 An Efficient MapReduce Cube Algorithm for Varied Data Distributions 2016 SIGMOD 4.1945683e-05
11,882 Parallel Evaluation of Multi-Semi-Joins 2016 VLDB 4.1945683e-05
11,949 Big Data Research: Will Industry Solve all the Problems? 2015 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 22 of 22 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
99 On the Propagation of Errors in the Size of Join Results 1991 SIGMOD 0.00050022914
109 Dremel: Interactive Analysis of Web-Scale Datasets 2010 VLDB 0.00048186983
115 Eddies: Continuously Adaptive Query Processing 2000 SIGMOD 0.00046221215
157 HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads 2009 VLDB 0.00040397359
285 Automating Physical Database Design in a Parallel Database 2002 SIGMOD 0.0002899128
502 Worst-case Optimal Join Algorithms 2012 PODS 0.00021526612
542 Shark: SQL and Rich Analytics at Scale 2013 SIGMOD 0.00020595648
773 Multi-Dimensional Database Allocation for Parallel Data Warehouses 2000 VLDB 0.00016870159
906 F1: A Distributed SQL Database That Scales 2013 VLDB 0.00015448884
1,063 Tradeoffs in Processing Complex Join Queries via Hashing in Multiprocessor Database Machines 1990 VLDB 0.00014362773
1,110 Parallel Evaluation of Conjunctive Queries 2011 PODS 0.00013968198
1,411 Communication Steps for Parallel Query Processing 2013 PODS 0.0001212565
1,557 Beyond Worst-case Analysis for Joins with Minesweeper 2014 PODS 0.00011392493
2,044 Optimization of Multi-Way Join Queries for Parallel Execution 1991 VLDB 9.6953608e-05
2,212 Skew in Parallel Query Processing 2014 PODS 9.2771827e-05
2,413 Automated Partitioning Design in Parallel Database Systems 2011 SIGMOD 8.8672223e-05
2,526 Track Join: Distributed Joins with Minimal Network Traffic 2014 SIGMOD 8.5968612e-05
3,062 Efficient Multi-way Theta-Join Processing Using MapReduce 2012 VLDB 7.6343994e-05
3,377 Demonstration of the Myria Big Data Management Service 2014 SIGMOD 7.1624478e-05
3,382 Scalable and Adaptive Online Joins 2014 VLDB 7.1597145e-05
4,132 Advanced Join Strategies for Large-Scale Distributed Computation 2014 VLDB 6.4241067e-05
8,108 Execution Primitives for Scalable Joins and Aggregations in Map Reduce 2014 VLDB 4.5846987e-05
Previous Page 1 / 1 Next

Semantically Similar Papers