Database Paper Browser

Back to papers

Learning to be a Statistician: Learned Estimator for Number of Distinct Values

Summary: Proposes a supervised-learning NDV estimator, replacing heuristic sample methods with a data-driven model. Trains on synthetic data for workload-agnostic deployment as a UDF, outperforming existing estimators on nine real datasets; code available. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12759
Venue
VLDB
Year
2022
Pagerank
4.6965039e-05
Overall Rank
7,610 | 47.06%
DOI
10.14778/3489496.3489508

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 9 of 9 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 13 of 13 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers