Database Paper Browser

Back to papers

Dynamic Sample Selection for Approximate Query Processing

Summary: Dynamic per-query biased sampling for approximate query processing using a pre-built library of non-uniform samples. Runtime selects the most informative sub-sample via an index, delivering tighter aggregations than static sampling with improved accuracy and responsiveness. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
3461
Venue
SIGMOD
Year
2003
Pagerank
0.00012993347
Overall Rank
1,260 | 91.24%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 44 of 44 citing papers.

Rank Citing Paper Year Venue Pagerank
1,204 VerdictDB: Universalizing Approximate Query Processing 2018 SIGMOD 0.00013319541
1,323 Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters 2016 SIGMOD 0.00012601997
1,475 Online Maintenance of Very Large Random Samples on Flash Storage 2008 VLDB 0.00011806921
1,574 Approximate Query Processing: No Silver Bullet 2017 SIGMOD 0.00011287495
2,011 Rapid Sampling for Visualizations with Ordering Guarantees 2015 VLDB 9.7964875e-05
2,184 A Sample-and-Clean Framework for Fast and Accurate Query Processing on Dirty Data 2014 SIGMOD 9.3429789e-05
2,355 G-OLA: Generalized On-Line Aggregation for Interactive Analysis on Big Data 2015 SIGMOD 8.9677847e-05
2,365 The Analytical Bootstrap: a New Method for Fast Error Estimation in Approximate Query Processing 2014 SIGMOD 8.9551432e-05
2,368 Online Maintenance of Very Large Random Samples 2004 SIGMOD 8.9501526e-05
2,580 Sample + Seek: Approximating Aggregates with Distribution Precision Guarantee 2016 SIGMOD 8.5058814e-05
2,588 Database Learning: Toward a Database that Becomes Smarter Every Time 2017 SIGMOD 8.4909562e-05
3,594 Continuous Sampling for Online Aggregation Over Multiple Queries 2010 SIGMOD 6.9381343e-05
3,835 I've Seen "Enough": Incrementally Improving Visualizations to Support Rapid Decision Making 2017 VLDB 6.7163364e-05
3,944 AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics 2018 SIGMOD 6.6078243e-05
3,954 Efficiently Approximating Selectivity Functions using Low Overhead Regression Models 2020 VLDB 6.5926838e-05
4,014 Exploiting Correlations for Expensive Predicate Evaluation 2015 SIGMOD 6.5273084e-05
4,546 Bounded Conjunctive Queries 2014 VLDB 6.0987778e-05
4,681 Adaptive Sampling for Rapidly Matching Histograms 2018 VLDB 6.0034918e-05
5,252 Error-bounded Sampling for Analytics on Big Sparse Data 2014 VLDB 5.6024389e-05
5,539 Supporting Time-Constrained SQL Queries in Oracle 2007 VLDB 5.4503121e-05
5,806 BlinkML: Efficient Maximum Likelihood Estimation with Probabilistic Guarantees 2019 SIGMOD 5.3200643e-05
5,817 Derby/S: A DBMS for Sample-Based Query Answering 2006 SIGMOD 5.3156799e-05
5,868 ABS: a System for Scalable Approximate Queries with Accuracy Guarantees 2014 SIGMOD 5.2959352e-05
6,330 Efficient Construction of Approximate Ad-Hoc ML models Through Materialization and Reuse 2018 VLDB 5.1077416e-05
6,400 iOLAP: Managing Uncertainty for Efficient Incremental OLAP 2016 SIGMOD 5.0803518e-05
6,493 Joins on Samples: A Theoretical Guide for Practitioners 2020 VLDB 5.0424713e-05
6,740 Combining Aggregation and Sampling (Nearly) Optimally for Approximate Query Processing 2021 SIGMOD 4.944395e-05
6,822 Skimmer: Rapid Scrolling of Relational Query Results 2012 SIGMOD 4.9152454e-05
7,085 Querying Big Data by Accessing Small Data 2015 PODS 4.8388174e-05
7,251 Learning to Sample: Counting with Complex Queries 2020 VLDB 4.7890519e-05
7,305 Tempura: A General Cost-Based Optimizer Framework for Incremental Data Processing 2021 VLDB 4.7678776e-05
7,784 Authenticated Online Data Integration Services 2015 SIGMOD 4.6517065e-05
7,872 Probabilistic Database Summarization for Interactive Data Exploration 2017 VLDB 4.6307184e-05
8,240 Experiences with Approximating Queries in Microsoft’s Production Big-Data Clusters 2019 VLDB 4.5522563e-05
8,643 One Size Does Not Fit All: A Bandit-Based Sampler Combination Framework with Theoretical Guarantees 2022 SIGMOD 4.4777916e-05
8,684 Unbiased Estimation of Size and Other Aggregates Over Hidden Web Databases 2010 SIGMOD 4.4677591e-05
8,715 Data Driven Approximation with Bounded Resources 2017 VLDB 4.4619052e-05
9,431 PairwiseHist: Fast, Accurate and Space-Efficient Approximate Query Processing with Data Compression 2024 VLDB 4.3434046e-05
10,497 PilotDB: Database-Agnostic Online Approximate Query Processing with A Priori Error Guarantees 2025 SIGMOD 4.1945683e-05
10,881 Datamap-Driven Tabular Coreset Selection for Classifier Training 2025 VLDB 4.1945683e-05
10,981 Enabling Adaptive Sampling for Intra-Window Join: Simultaneously Optimizing Quantity and Quality 2024 SIGMOD 4.1945683e-05
11,194 A Step Toward Deep Online Aggregation 2023 SIGMOD 4.1945683e-05
11,429 Leam: An Interactive System for In-situ Visual Text Analysis 2021 CIDR 4.1945683e-05
11,539 FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data 2021 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 17 of 17 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers