Database Paper Browser

Back to papers

Quantifying TPC-H Choke Points and Their Optimizations

Summary: Systematic analysis of TPC-H choke points; quantifies impact of optimizations across eleven points. Highlights flattening subqueries and predicate placement as most impactful; plan choice drives Q2, Q17, Q21 while engine efficiency dominates Q1, Q13, Q18. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12037
Venue
VLDB
Year
2020
Pagerank
7.9068048e-05
Overall Rank
2,916 | 79.72%
DOI
10.14778/3389133.3389138

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 28 of 28 citing papers.

Rank Citing Paper Year Venue Pagerank
3,668 The LDBC Social Network Benchmark: Business Intelligence Workload 2023 VLDB 6.8591612e-05
3,721 To Partition, or Not to Partition, That is the Join Question in a Real System 2021 SIGMOD 6.8179379e-05
4,281 Maximizing Persistent Memory Bandwidth Utilization for OLAP Workloads 2021 SIGMOD 6.2940039e-05
4,495 ClickHouse - Lightning Fast Analytics for Everyone 2024 VLDB 6.1410277e-05
4,704 JSON Tiles: Fast Analytics on Semi-Structured Data 2021 SIGMOD 5.9853687e-05
4,717 Cloud Analytics Benchmark 2023 VLDB 5.9751539e-05
5,530 Permutable Compiled Queries: Dynamically Adapting Compiled Queries without Recompiling 2021 VLDB 5.4554282e-05
5,923 HyBench: A New Benchmark for HTAP Databases 2024 VLDB 5.2721765e-05
6,223 Distributed GPU Joins on Fast RDMA-capable Networks 2023 SIGMOD 5.1496398e-05
7,831 CUBIT: Concurrent Updatable Bitmap Indexing 2025 VLDB 4.6387445e-05
8,207 SQLStorm: Taking Database Benchmarking into the LLM Era 2025 VLDB 4.5583637e-05
8,415 Pruning in Snowflake: Working Smarter, Not Harder 2025 SIGMOD 4.5197687e-05
8,482 Cost Modelling for Optimal Data Placement in Heterogeneous Main Memory 2022 VLDB 4.5010191e-05
8,513 CXL Memory Performance for In-Memory Data Processing 2025 VLDB 4.4947795e-05
8,578 Robust and Budget-Constrained Encoding Configurations for In-Memory Database Systems 2022 VLDB 4.4923477e-05
8,645 Predicate Pushdown for Data Science Pipelines 2023 SIGMOD 4.4772518e-05
8,680 A Practical Approach to Groupjoin and Nested Aggregates 2021 VLDB 4.4694927e-05
8,720 Entropy-Learned Hashing: Constant Time Hashing with Controllable Uniformity 2022 SIGMOD 4.4609699e-05
8,884 Workload Insights From The Snowflake Data Cloud: What Do Production Analytic Queries Really Look Like? 2025 VLDB 4.4283999e-05
9,763 The UDFBench Benchmark for General-purpose UDF Queries 2025 VLDB 4.2856106e-05
10,105 RABIT: Efficient Range Queries with Bitmap Indexing 2026 SIGMOD 4.1945683e-05
10,243 TPCx-AI under the Microscope: A Benchmarking Debt Analysis 2026 VLDB 4.1945683e-05
10,652 The LDBC Financial Benchmark: Transaction Workload 2025 VLDB 4.1945683e-05
10,653 Alchemy: A Query Optimization Framework for Oblivious SQL 2025 VLDB 4.1945683e-05
10,707 PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking 2025 VLDB 4.1945683e-05
11,023 Window Function Expression: Let the Self-join Enter 2024 VLDB 4.1945683e-05
11,066 OLAP on Modern Chiplet-Based Processors 2024 VLDB 4.1945683e-05
11,415 Budget-Conscious Fine-Grained Configuration Optimization for Spatio-Temporal Applications 2022 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 24 of 24 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
52 Database Architecture Optimized for the new Bottleneck: Memory Access 1999 VLDB 0.00066474881
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
185 DuckDB: an Embeddable Analytical Database 2019 SIGMOD 0.00036538405
368 Small Materialized Aggregates: A Light Weight Index Structure for Data Warehousing 1998 VLDB 0.000254931
418 Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age 2014 SIGMOD 0.00023729211
423 Measuring the Complexity of Join Enumeration in Query Optimization 1990 VLDB 0.00023669348
659 The Making of TPC-DS 2006 VLDB 0.00018500853
735 Umbra: A Disk-Based System with In-Memory Performance 2020 CIDR 0.00017452467
795 Conjunctive Selection Conditions in Main Memory 2002 PODS 0.00016600368
853 Everything You Always Wanted to Know About Compiled and Vectorized Queries But Were Afraid to Ask 2018 VLDB 0.00015940507
1,057 Cosette: An Automated Prover for SQL 2017 CIDR 0.0001439886
1,432 An Empirical Evaluation of In-Memory Multi-Version Concurrency Control 2017 VLDB 0.00012017544
1,619 Adaptive Optimization of Very Large Join Queries 2018 SIGMOD 0.00011111678
1,826 Analysis of Two Existing and One New Dynamic Programming Algorithm for the Generation of Optimal Bushy Join Trees without Cross Products 2006 VLDB 0.00010400425
2,412 WideTable: An Accelerator for Analytical Data Processing 2014 VLDB 8.8726508e-05
2,504 Enhanced Subquery Optimizations in Oracle 2009 VLDB 8.6351917e-05
3,922 Pushing Data-Induced Predicates Through Joins in Big-Data Clusters 2020 VLDB 6.6291079e-05
3,951 Why You Should Run TPC-DS: A Workload Analysis 2007 VLDB 6.5953162e-05
4,680 To Share or Not to Share? 2007 VLDB 6.0039406e-05
4,956 Dimensions Based Data Clustering and Zone Maps 2017 VLDB 5.8040891e-05
6,114 Database Processing-in-Memory: An Experimental Study 2020 VLDB 5.204248e-05
7,053 Statisticum: Data Statistics Management in SAP HANA 2017 VLDB 4.8497195e-05
9,299 Engineering High-Performance Database Engines 2014 VLDB 4.3587894e-05
9,300 Keeping the TPC Relevant! 2013 VLDB 4.3587894e-05
Previous Page 1 / 1 Next

Semantically Similar Papers