Database Paper Browser

Back to papers

Knowing When You’re Wrong: Building Fast and Reliable Approximate Query Processing Systems

Summary: Sampling-based AQP for large-scale analytics; error bars often fail on real workloads. Fast diagnostics of error-estimation failures and a pipeline that yields approximate answers with reliable bootstrap-based error bars at interactive speeds. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4797
Venue
SIGMOD
Year
2014
Pagerank
0.00010244443
Overall Rank
1,874 | 86.97%
DOI
10.1145/2588555.2593667

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 39 of 39 citing papers.

Rank Citing Paper Year Venue Pagerank
696 BlazeIt: Optimizing Declarative Aggregation and Limit Queries for Neural Network-Based Video Analytics 2020 VLDB 0.00018048935
943 Wander Join: Online Aggregation via Random Walks 2016 SIGMOD 0.00015145883
1,204 VerdictDB: Universalizing Approximate Query Processing 2018 SIGMOD 0.00013319541
1,323 Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters 2016 SIGMOD 0.00012601997
1,552 Overview of Data Exploration Techniques 2015 SIGMOD 0.00011408814
1,574 Approximate Query Processing: No Silver Bullet 2017 SIGMOD 0.00011287495
2,355 G-OLA: Generalized On-Line Aggregation for Interactive Analysis on Big Data 2015 SIGMOD 8.9677847e-05
2,365 The Analytical Bootstrap: a New Method for Fast Error Estimation in Approximate Query Processing 2014 SIGMOD 8.9551432e-05
2,580 Sample + Seek: Approximating Aggregates with Distribution Precision Guarantee 2016 SIGMOD 8.5058814e-05
2,588 Database Learning: Toward a Database that Becomes Smarter Every Time 2017 SIGMOD 8.4909562e-05
2,616 DAQ: A New Paradigm for Approximate Query Processing 2015 VLDB 8.4471955e-05
3,333 SnappyData: A Unified Cluster for Streaming, Transactions, and Interactive Analytics 2017 CIDR 7.2093479e-05
3,558 Approximate Selection with Guarantees using Proxies 2020 VLDB 6.9765724e-05
3,912 Two Birds, One Stone: A Fast, yet Lightweight, Indexing Scheme for Modern Database Systems 2017 VLDB 6.6354964e-05
3,944 AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics 2018 SIGMOD 6.6078243e-05
4,434 Lightweight and Accurate Cardinality Estimation by Neural Network Gaussian Process 2022 SIGMOD 6.1929999e-05
5,224 Neighbor-Sensitive Hashing 2016 VLDB 5.6197981e-05
5,262 SnappyData: A Hybrid Transactional Analytical Store Built On Spark 2016 SIGMOD 5.5977349e-05
5,581 CliffGuard: A Principled Framework for Finding Robust Database Designs 2015 SIGMOD 5.424205e-05
5,806 BlinkML: Efficient Maximum Likelihood Estimation with Probabilistic Guarantees 2019 SIGMOD 5.3200643e-05
5,909 At-the-time and Back-in-time Persistent Sketches 2021 SIGMOD 5.2769377e-05
6,400 iOLAP: Managing Uncertainty for Efficient Incremental OLAP 2016 SIGMOD 5.0803518e-05
6,411 Approximate Query Engines: Commercial Challenges and Research Opportunities 2017 SIGMOD 5.0752468e-05
6,493 Joins on Samples: A Theoretical Guide for Practitioners 2020 VLDB 5.0424713e-05
7,085 Querying Big Data by Accessing Small Data 2015 PODS 4.8388174e-05
7,358 Weighted Distinct Sampling: Cardinality Estimation for SPJ Queries 2021 SIGMOD 4.7529363e-05
7,747 TSCache: An Efficient Flash-based Caching Scheme for Time-series Data Workloads 2021 VLDB 4.6616405e-05
8,080 Biathlon: Harnessing Model Resilience for Accelerating ML Inference Pipelines 2024 VLDB 4.5911668e-05
8,138 Fast and Reliable Missing Data Contingency Analysis with Predicate-Constraints 2020 SIGMOD 4.5771031e-05
8,337 THEMIS: Fairness in Federated Stream Processing under Overload 2016 SIGMOD 4.5434623e-05
8,643 One Size Does Not Fit All: A Bandit-Based Sampler Combination Framework with Theoretical Guarantees 2022 SIGMOD 4.4777916e-05
8,689 Wander Join: Online Aggregation for Joins 2016 SIGMOD 4.4667389e-05
8,728 Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views 2015 VLDB 4.4589711e-05
9,296 Controlled Intentional Degradation in Analytical Video Systems 2022 SIGMOD 4.3599613e-05
9,382 Hephaestus: Data Reuse for Accelerating Scientific Discovery 2015 CIDR 4.3457368e-05
9,431 PairwiseHist: Fast, Accurate and Space-Efficient Approximate Query Processing with Data Compression 2024 VLDB 4.3434046e-05
10,279 ConANN: Conformal Approximate Nearest Neighbor Search 2026 VLDB 4.1945683e-05
11,711 Demonstration of VerdictDB, the Platform-Independent AQP System 2018 SIGMOD 4.1945683e-05
11,832 A Study of Sorting Algorithms on Approximate Memory 2016 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 15 of 15 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers