Database Paper Browser

Back to papers

Approximate Query Processing: Taming the TeraBytes! A Tutorial

Summary: Survey of approximate query processing for terabytes, contrasting online aggregation with precomputed synopses for fast, bounded results. Covers multi-dimensional and join synopses, set-valued queries, AQUA-style rewrite, maintenance, streaming data. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
8824
Venue
VLDB
Year
2001
Pagerank
0.00022846068
Overall Rank
449 | 96.88%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 41 of 41 citing papers.

Rank Citing Paper Year Venue Pagerank
149 Trio: A System for Integrated Management of Data, Accuracy, and Lineage 2005 CIDR 0.00041101118
475 Mining Database Structure; Or, How to Build a Data Quality Browser 2002 SIGMOD 0.00022303253
477 Model-Driven Data Acquisition in Sensor Networks 2004 VLDB 0.00022221803
696 BlazeIt: Optimizing Declarative Aggregation and Limit Queries for Neural Network-Based Video Analytics 2020 VLDB 0.00018048935
905 The Design of an Acquisitional Query Processor For Sensor Networks 2003 SIGMOD 0.0001546195
1,064 Processing Complex Aggregate Queries over Data Streams 2002 SIGMOD 0.00014356481
1,152 Blink and It's Done: Interactive Queries on Very Large Data 2012 VLDB 0.00013645792
1,420 Data Management Challenges in Production Machine Learning 2017 SIGMOD 0.00012057956
1,574 Approximate Query Processing: No Silver Bullet 2017 SIGMOD 0.00011287495
2,011 Rapid Sampling for Visualizations with Ordering Guarantees 2015 VLDB 9.7964875e-05
2,118 Using Probabilistic Models for Data Management in Acquisitional Environments 2005 CIDR 9.5100739e-05
2,184 A Sample-and-Clean Framework for Fast and Accurate Query Processing on Dirty Data 2014 SIGMOD 9.3429789e-05
2,492 Partial Results for Online Query Processing 2002 SIGMOD 8.6526489e-05
2,580 Sample + Seek: Approximating Aggregates with Distribution Precision Guarantee 2016 SIGMOD 8.5058814e-05
2,863 Incremental and Approximate Inference for Faster Occlusion-based Deep CNN Explanations 2019 SIGMOD 7.9877991e-05
3,393 Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows 2022 VLDB 7.1483239e-05
3,565 Cache-Craft: Managing Chunk-Caches for Efficient Retrieval-Augmented Generation 2025 SIGMOD 6.9655362e-05
3,835 I've Seen "Enough": Incrementally Improving Visualizations to Support Rapid Decision Making 2017 VLDB 6.7163364e-05
4,014 Exploiting Correlations for Expensive Predicate Evaluation 2015 SIGMOD 6.5273084e-05
4,668 PrivateClean: Data Cleaning and Differential Privacy 2016 SIGMOD 6.0115918e-05
4,716 Mining Graph Patterns Efficiently via Randomized Summaries 2009 VLDB 5.9755569e-05
4,909 A Method for Optimizing Opaque Filter Queries 2020 SIGMOD 5.8338804e-05
5,140 A Random Walk Approach to Sampling Hidden Databases 2007 SIGMOD 5.668209e-05
6,330 Efficient Construction of Approximate Ad-Hoc ML models Through Materialization and Reuse 2018 VLDB 5.1077416e-05
6,838 Capturing Data Uncertainty in High-Volume Stream Processing 2009 CIDR 4.9109732e-05
7,477 Benchmarking Spreadsheet Systems 2020 SIGMOD 4.7188671e-05
7,890 Mining a Search Engine’s Corpus: Efficient Yet Unbiased Sampling and Aggregate Estimation 2011 SIGMOD 4.6249533e-05
8,728 Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views 2015 VLDB 4.4589711e-05
8,851 Efficient Approximations of Conjunctive Queries 2012 PODS 4.4363908e-05
9,432 Aggregate Estimation Over Dynamic Hidden Web Databases 2014 VLDB 4.3431757e-05
9,614 Auto-Approximation of Graph Computing 2014 VLDB 4.3177432e-05
9,992 Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First 2026 CIDR 4.1945683e-05
10,049 Approximate Query Processing under Updates 2026 SIGMOD 4.1945683e-05
10,116 Stochastic Submodular Data Forgetting 2026 SIGMOD 4.1945683e-05
10,215 Task Cascades for Efficient Unstructured Data Processing 2026 SIGMOD 4.1945683e-05
10,886 FaDE: More Than a Million What-ifs Per Second 2025 VLDB 4.1945683e-05
11,502 In the Land of Data Streams where Synopses are Missing, One Framework to Bring Them All 2021 VLDB 4.1945683e-05
11,691 Enabling Data Science for the Majority 2019 VLDB 4.1945683e-05
11,832 A Study of Sorting Algorithms on Approximate Memory 2016 SIGMOD 4.1945683e-05
12,019 When Data Management Systems Meet Approximate Hardware: Challenges and Opportunities 2014 VLDB 4.1945683e-05
12,506 AQAX: A System for Approximate XML Query Answers 2006 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 2 of 52 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
5,082 A Comparison of Selectivity Estimators for Range Queries on Metric Attributes 1999 SIGMOD 5.711623e-05
5,982 Modeling skewed distributions using multifractals and the '80-20 law' 1996 VLDB 5.2446136e-05
Previous Page 2 / 2 Next

Semantically Similar Papers