Database Paper Browser

Back to papers

A Practical Approach to Groupjoin and Nested Aggregates

Summary: Two novel techniques: aggregate estimates for distributions, and parallel groupjoin execution for scalable, contention-free groupby+join. This yields better estimation and planning for nested aggregates, with up to 2× speedups on TPC-H queries. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12414
Venue
VLDB
Year
2021
Pagerank
4.4694927e-05
Overall Rank
8,680 | 39.62%
DOI
10.14778/3476249.3476288

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 5 of 5 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 39 of 39 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
35 MonetDB/X100: Hyper-Pipelining Query Execution 2005 CIDR 0.00076197749
44 The Design Of Postgres 1986 SIGMOD 0.00071838587
60 Efficiently Compiling Efficient Query Plans for Modern Hardware 2011 VLDB 0.00064439773
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
100 Of Nests and Trees: A Unified Approach to Processing Queries That Contain Nested Subqueries, Aggregates, and Quantifiers 1987 VLDB 0.00049624696
145 Quickly Generating Billion-Record Synthetic Databases 1994 SIGMOD 0.0004138408
204 Learned Cardinalities: Estimating Correlated Joins with Deep Learning 2019 CIDR 0.00034784455
241 DB2 with BLU Acceleration: So Much More than Just a Column Store 2013 VLDB 0.00031420034
248 Eager Aggregation and Lazy Aggregation 1995 VLDB 0.00030785339
351 Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs 2009 VLDB 0.0002636504
418 Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age 2014 SIGMOD 0.00023729211
608 DeepDB: Learn from Data, not from Queries! 2020 VLDB 0.00019235898
629 Preventing Bad Plans by Bounding the Impact of Cardinality Estimation Errors 2009 VLDB 0.00018942366
639 Orthogonal Optimization of Subqueries and Aggregation 2001 SIGMOD 0.00018791492
659 The Making of TPC-DS 2006 VLDB 0.00018500853
714 Adaptive Aggregation on Chip Multiprocessors 2007 VLDB 0.00017730584
735 Umbra: A Disk-Based System with In-Memory Performance 2020 CIDR 0.00017452467
990 Improved Unnesting Algorithms for Join Aggregate SQL Queries 1992 VLDB 0.00014809094
1,254 Selectivity Estimation for Range Predicates using Lightweight Models 2019 VLDB 0.00013027411
1,313 Cost-Based Optimization for Magic: Algebra and Implementation 1996 SIGMOD 0.0001263831
1,323 Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters 2016 SIGMOD 0.00012601997
1,582 Execution Strategies for SQL Subqueries 2007 SIGMOD 0.00011265079
1,804 An Experimental Comparison of Thirteen Relational Equi-Joins in Main Memory 2016 SIGMOD 0.00010501185
1,864 Relaxed Operator Fusion for In-Memory Databases: Making Compilation, Vectorization, and Prefetching Work Together At Last 2018 VLDB 0.00010280966
2,504 Enhanced Subquery Optimizations in Oracle 2009 VLDB 8.6351917e-05
2,808 A Robust, Optimization-Based Approach for Approximate Answering of Aggregate Queries 2001 SIGMOD 8.0870741e-05
2,916 Quantifying TPC-H Choke Points and Their Optimizations 2020 VLDB 7.9068048e-05
3,277 A Layered Aggregate Engine for Analytics Workloads 2019 SIGMOD 7.2871625e-05
3,702 Every Row Counts: Combining Sketches and Sampling for Accurate Group-By Result Estimates 2019 CIDR 6.8295759e-05
3,721 To Partition, or Not to Partition, That is the Join Question in a Real System 2021 SIGMOD 6.8179379e-05
5,087 Accelerating Queries with Group-By and Join by Groupjoin 2011 VLDB 5.7075009e-05
5,252 Error-bounded Sampling for Analytics on Big Sparse Data 2014 VLDB 5.6024389e-05
5,815 StatAdvisor: Recommending Statistical Views 2009 VLDB 5.3165295e-05
6,540 Data Partitioning for In-Memory Systems: Myths, Challenges, and Opportunities 2019 CIDR 5.0219214e-05
6,672 Optimization of Nested Queries using the NF2 Algebra 2016 SIGMOD 4.9669223e-05
7,658 PgCuckoo: Laying Plan Eggs in PostgreSQL's Nest 2019 SIGMOD 4.6869093e-05
8,051 Building Advanced SQL Analytics From Low-Level Plan Operators 2021 SIGMOD 4.5969549e-05
8,997 Chasing Similarity: Distribution-aware Aggregation Scheduling 2019 VLDB 4.4120041e-05
9,861 Bridging the Chasm between Science and Reality 2021 CIDR 4.2689311e-05
Previous Page 1 / 1 Next

Semantically Similar Papers