Consistent and Flexible Selectivity Estimation for High-Dimensional Data

Summary: Deep-learning-based selectivity estimation learns a query-dependent piecewise-linear function whose output is guaranteed to be non-decreasing in the threshold. To scale to high-dimensional data, the method partitions the dataset into disjoint subsets and trains local models, achieving superior accuracy and efficiency over state-of-the-art approaches on real data. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 6054
Venue: SIGMOD
Year: 2021
Pagerank: 4.5261239e-05
Overall Rank: 8,383 | 41.74%
DOI: 10.1145/3448016.3452772

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 7 of 7 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
2,988	Neural Subgraph Counting with Wasserstein Estimator	2022	SIGMOD	7.7752463e-05
4,273	Similarity Query Processing for High-Dimensional Data	2020	VLDB	6.2932217e-05
9,487	Spatial Query Optimization With Learning	2024	VLDB	4.3300131e-05
9,621	ShadowAQP: Efficient Approximate Group-by and Join Query via Attribute-oriented Sample Size Allocation and Data Generation	2023	VLDB	4.3125802e-05
10,219	Practical Parameterized Query Optimization via Efficient Plan Reuse and List-wise Ranking	2026	SIGMOD	4.1905499e-05
10,837	Cardinality Estimation for Similarity Search on High-Dimensional Data Objects: The Impact of Reference Objects	2025	VLDB	4.1905499e-05
11,192	Efficient and Effective Cardinality Estimation for Skyline Family	2023	SIGMOD	4.1905499e-05

Outgoing Citations (Sorted by Pagerank)

Showing 15 of 15 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
101	The Case for Learned Index Structures	2018	SIGMOD	0.00049778866
159	LOF: Identifying Density-Based Local Outliers	2000	SIGMOD	0.00040135453
203	Learned Cardinalities: Estimating Correlated Joins with Deep Learning	2019	CIDR	0.00034868567
219	Deep Entity Matching with Pre-Trained Language Models	2021	VLDB	0.00033354456
325	The History of Histograms (abridged)	2003	VLDB	0.00027398081
752	Deep Unsupervised Cardinality Estimation	2020	VLDB	0.00017138049
804	An End-to-End Learning-based Cost Estimator	2020	VLDB	0.0001643674
1,727	QuickSel: Quick Selectivity Learning with Mixture Models	2020	SIGMOD	0.00010731889
1,756	Sampling-Based Query Re-Optimization	2016	SIGMOD	0.00010659753
2,108	LISA: A Learned Index Structure for Spatial Data	2020	SIGMOD	9.5283642e-05
2,167	Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation	2015	SIGMOD	9.3879598e-05
2,176	Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services	2017	SIGMOD	9.3729351e-05
2,364	Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries	2020	SIGMOD	8.955077e-05
5,630	Monotonic Cardinality Estimation of Similarity Selection: A Deep Learning Approach	2020	SIGMOD	5.4010111e-05
7,246	Learning to Sample: Counting with Complex Queries	2020	VLDB	4.7847433e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
1,240	Multi-dimensional Selectivity Estimation Using Compressed Histogram Information	1999	SIGMOD	0.00013090678
7,473	Cardinality Estimation of Approximate Substring Queries using Deep Learning	2022	VLDB	4.7149077e-05
9,690	Selectivity Estimation for Queries Containing Predicates over Set-Valued Attributes	2023	SIGMOD	4.2994116e-05
3,955	Efficiently Approximating Selectivity Functions using Low Overhead Regression Models	2020	VLDB	6.5895015e-05
2,167	Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation	2015	SIGMOD	9.3879598e-05
7,442	Selectivity Functions of Range Queries are Learnable*	2022	SIGMOD	4.7248554e-05
5,630	Monotonic Cardinality Estimation of Similarity Selection: A Deep Learning Approach	2020	SIGMOD	5.4010111e-05
1,239	Selectivity Estimation for Range Predicates using Lightweight Models	2019	VLDB	0.00013091459
9,117	Deep Query Optimization	2019	SIGMOD	4.3885415e-05
2,364	Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries	2020	SIGMOD	8.955077e-05