Database Paper Browser

Back to papers

FactorJoin: A New Cardinality Estimation Framework for Join Queries

Summary: FactorJoin blends histogram efficiency with learned correlations. Offline single-table distributions and a factor-graph join model enable cardinality estimates without denormalization or workloads; small footprint and 40x latency, 100x smaller model. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6544
Venue
SIGMOD
Year
2023
Pagerank
6.5581983e-05
Overall Rank
3,990 | 72.25%
DOI
10.1145/3588721

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 32 of 32 citing papers.

Rank Citing Paper Year Venue Pagerank
4,417 Robust Query Driven Cardinality Estimation under Changing Workloads 2023 VLDB 6.2037371e-05
5,401 ALECE: An Attention-based Learned Cardinality Estimator for SPJ Queries on Dynamic Workloads 2024 VLDB 5.5285035e-05
5,832 Stage: Query Execution Time Prediction in Amazon Redshift 2024 SIGMOD 5.3111109e-05
6,383 Sample-Efficient Cardinality Estimation Using Geometric Deep Learning 2024 VLDB 5.0884322e-05
6,898 Disclosure-Compliant Query Answering 2024 SIGMOD 4.8925595e-05
6,969 LpBound: Pessimistic Cardinality Estimation using ℓp-Norms of Degree Sequences 2025 SIGMOD 4.8799937e-05
7,123 ASM: Harmonizing Autoregressive Model, Sampling, and Multi-dimensional Statistics Merging for Cardinality Estimation 2024 SIGMOD 4.8251036e-05
7,990 Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD 2024 VLDB 4.6117441e-05
8,834 ByteCard: Enhancing ByteDance’s Data Warehouse with Learned Cardinality Estimation 2024 SIGMOD 4.4394021e-05
9,230 LeaFi: Data Series Indexes on Steroids with Learned Filters 2025 SIGMOD 4.3690661e-05
9,317 Are Joins over LSM-trees Ready? Take RocksDB as an Example 2025 VLDB 4.3556432e-05
9,485 Spatial Query Optimization With Learning 2024 VLDB 4.3341665e-05
9,621 ShadowAQP: Efficient Approximate Group-by and Join Query via Attribute-oriented Sample Size Allocation and Data Generation 2023 VLDB 4.3167167e-05
9,812 A Practical Theory of Generalization in Selectivity Learning 2025 VLDB 4.2783272e-05
9,825 Athena: An Effective Learning-based Framework for Query Optimizer Performance Improvement 2025 SIGMOD 4.2751057e-05
9,845 Path-centric Cardinality Estimation for Subgraph Matching 2025 VLDB 4.2721228e-05
9,852 Machine Unlearning in Learned Databases: An Experimental Analysis 2024 SIGMOD 4.2714575e-05
9,878 PRICE: A Pretrained Model for Cross-Database Cardinality Estimation 2025 VLDB 4.2656547e-05
9,917 Check Out the Big Brain on BRAD: Simplifying Cloud Data Processing with Learned Automated Data Meshes 2023 VLDB 4.2561557e-05
9,960 An Elephant Under The Microscope: Analyzing The Interaction Of Optimizer Components In PostgreSQL 2025 SIGMOD 4.2294678e-05
9,988 I Can't Believe It's Not Yannakakis: Pragmatic Bitmap Filters in Microsoft SQL Server 2026 CIDR 4.1945683e-05
10,018 GenJoin: Conditional Generative Plan-to-Plan Query Optimizer that Learns from Subplan Hints 2026 SIGMOD 4.1945683e-05
10,149 CorrBound: Cardinality Estimation Accounting for Inter- and Intra-relation Correlations 2026 SIGMOD 4.1945683e-05
10,241 Robust Predicate Transfer with Dynamic Execution 2026 VLDB 4.1945683e-05
10,271 OBELISK: Efficient Offline Query Planning with Bayesian Optimization-Informed Language Model Reasoning 2026 VLDB 4.1945683e-05
10,445 LpBound in Action: Cardinality Estimation with One-Sided Guarantees 2025 SIGMOD 4.1945683e-05
10,619 Data-Agnostic Cardinality Learning from Imperfect Workloads 2025 VLDB 4.1945683e-05
10,726 Improving DBMS Scheduling Decisions with Accurate Performance Prediction on Concurrent Queries 2025 VLDB 4.1945683e-05
10,772 veDB-HTAP: a Highly Integrated, Efficient and Adaptive HTAP System 2025 VLDB 4.1945683e-05
10,868 LEAP: A Low-cost Spark SQL Query Optimizer using Pairwise Comparison 2025 VLDB 4.1945683e-05
10,983 A Universal Sketch for Estimating Heavy Hitters and Per-Element Frequency Moments in Data Streams with Bounded Deletions 2024 SIGMOD 4.1945683e-05
11,084 Presto’s History-based Query Optimizer 2024 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 45 of 45 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1 Access Path Selection in a Relational Database Management System 1979 SIGMOD 0.0040449103
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
92 Practical Selectivity Estimation through Adaptive Sampling 1990 SIGMOD 0.00051315959
99 On the Propagation of Errors in the Size of Join Results 1991 SIGMOD 0.00050022914
141 Selectivity Estimation Without the Attribute Value Independence Assumption 1997 VLDB 0.00041786333
182 LEO - DB2's LEarning Optimizer 2001 VLDB 0.00036962631
204 Learned Cardinalities: Estimating Correlated Joins with Deep Learning 2019 CIDR 0.00034784455
325 The History of Histograms (abridged) 2003 VLDB 0.00027378328
327 Balancing Histogram Optimality and Practicality for Query Result Size Estimation 1995 SIGMOD 0.00027308479
333 Neo: A Learned Query Optimizer 2019 VLDB 0.00027206884
372 Selectivity Estimation using Probabilistic Models 2001 SIGMOD 0.00025354779
512 STHoles: A Multidimensional Workload-Aware Histogram 2001 SIGMOD 0.00021380733
608 DeepDB: Learn from Data, not from Queries! 2020 VLDB 0.00019235898
758 Deep Unsupervised Cardinality Estimation 2020 VLDB 0.0001706608
806 An End-to-End Learning-based Cost Estimator 2020 VLDB 0.00016434274
808 Universality of Serial Histograms 1993 VLDB 0.00016432772
842 Independence is Good: Dependency-Based Histogram Synopses for High-Dimensional Data 2001 SIGMOD 0.00016031973
884 Plan-Structured Deep Neural Network Models for Query Performance Prediction 2019 VLDB 0.00015654004
910 NeuroCard: One Cardinality Estimator for All Tables 2021 VLDB 0.00015423056
943 Wander Join: Online Aggregation via Random Walks 2016 SIGMOD 0.00015145883
996 Approximating Multi-Dimensional Aggregate Range Queries Over Real Attributes 2000 SIGMOD 0.00014741524
1,105 Cardinality Estimation Done Right: Index-Based Join Sampling 2017 CIDR 0.00013990395
1,254 Selectivity Estimation for Range Predicates using Lightweight Models 2019 VLDB 0.00013027411
1,369 Random Sampling over Joins Revisited 2018 SIGMOD 0.00012339777
1,442 What do Shannon-type Inequalities, Submodular Width, and Disjunctive Datalog have to do with one another? 2017 PODS 0.00011956109
1,547 Lightweight Graphical Models for Selectivity Estimation Without Independence Assumptions 2011 VLDB 0.00011442359
1,638 Cardinality Estimation in DBMS: A Comprehensive Benchmark Evaluation 2022 VLDB 0.00011049779
2,083 Towards a Learning Optimizer for Shared Clouds 2019 VLDB 9.5834572e-05
2,142 Pessimistic Cardinality Estimation: Tighter Upper Bounds for Intermediate Join Cardinalities 2019 SIGMOD 9.4507296e-05
2,165 Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation 2015 SIGMOD 9.389622e-05
2,364 Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries 2020 SIGMOD 8.9554751e-05
2,669 A Black-Box Approach to Query Cardinality Estimation 2007 CIDR 8.3389856e-05
2,762 FLAT: Fast, Lightweight and Accurate Method for Cardinality Estimation 2021 VLDB 8.1585394e-05
2,783 Flow-Loss: Learning Cardinality Estimates That Matter 2021 VLDB 8.1293383e-05
2,841 Selectivity Estimation in Extensible Databases - A Neural Network Approach 1998 VLDB 8.0287389e-05
2,969 Estimating Join Selectivities using Bandwidth-Optimized Kernel Density Models 2017 VLDB 7.7974762e-05
3,142 Active Learning for ML Enhanced Database Systems 2020 SIGMOD 7.4815444e-05
3,499 Fauce: Fast and Accurate Deep Ensembles with Uncertainty for Cardinality Estimation 2021 VLDB 7.0376445e-05
3,646 G-CARE: A Framework for Performance Benchmarking of Cardinality Estimation Techniques for Subgraph Matching 2020 SIGMOD 6.8853079e-05
3,924 A Unified Deep Model of Learning from both Data and Queries for Cardinality Estimation 2021 SIGMOD 6.6271553e-05
4,523 Simplicity Done Right for Join Ordering 2021 CIDR 6.1135504e-05
4,543 FACE: A Normalizing Flow based Cardinality Estimator 2022 VLDB 6.1011198e-05
5,258 One Model to Rule them All: Towards Zero-Shot Learning for Databases 2022 CIDR 5.5998705e-05
6,040 Steering Query Optimizers: A Practical Take on Big Data Workloads 2021 SIGMOD 5.2412035e-05
6,775 A Unified Transferable Model for ML-Enhanced DBMS 2022 CIDR 4.9299192e-05
Previous Page 1 / 1 Next

Semantically Similar Papers