Database Paper Browser

Back to papers

Pre-training Summarization Models of Structured Datasets for Cardinality Estimation

Summary: Pre-trained models convert structured datasets into compact summaries for cardinality estimation, eliminating per-dataset training and speeding up summary construction up to 100x. Uses multiple summaries per dataset and learned summaries for hard columnsets; shows shared frequency and correlation patterns learned from a diverse corpus with incremental updates. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12918
Venue
VLDB
Year
2022
Pagerank
5.0937722e-05
Overall Rank
6,368 | 55.70%
DOI
10.14778/3494124.3494127

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 10 of 10 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 21 of 21 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
102 The Case for Learned Index Structures 2018 SIGMOD 0.00049545203
166 Approximate Frequency Counts over Data Streams 2002 VLDB 0.00039361552
204 Learned Cardinalities: Estimating Correlated Joins with Deep Learning 2019 CIDR 0.00034784455
222 Wavelet-Based Histograms for Selectivity Estimation 1998 SIGMOD 0.00032828302
224 CORDS: Automatic Discovery of Correlations and Soft Functional Dependencies 2004 SIGMOD 0.00032746205
325 The History of Histograms (abridged) 2003 VLDB 0.00027378328
333 Neo: A Learned Query Optimizer 2019 VLDB 0.00027206884
372 Selectivity Estimation using Probabilistic Models 2001 SIGMOD 0.00025354779
378 Towards Estimation Error Guarantees for Distinct Values 2000 PODS 0.0002497492
405 Approximate Query Processing Using Wavelets 2000 VLDB 0.00024057494
512 STHoles: A Multidimensional Workload-Aware Histogram 2001 SIGMOD 0.00021380733
608 DeepDB: Learn from Data, not from Queries! 2020 VLDB 0.00019235898
727 On Synopses for Distinct-Value Estimation Under Multiset Operations 2007 SIGMOD 0.00017508726
758 Deep Unsupervised Cardinality Estimation 2020 VLDB 0.0001706608
842 Independence is Good: Dependency-Based Histogram Synopses for High-Dimensional Data 2001 SIGMOD 0.00016031973
910 NeuroCard: One Cardinality Estimator for All Tables 2021 VLDB 0.00015423056
1,254 Selectivity Estimation for Range Predicates using Lightweight Models 2019 VLDB 0.00013027411
1,400 Wavelet Synopses with Error Guarantees 2002 SIGMOD 0.00012191684
1,547 Lightweight Graphical Models for Selectivity Estimation Without Independence Assumptions 2011 VLDB 0.00011442359
2,165 Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation 2015 SIGMOD 9.389622e-05
Previous Page 1 / 1 Next

Semantically Similar Papers