Database Paper Browser

Back to papers

Approximate Query Processing: No Silver Bullet

Summary: State of the art of Approximate Query Processing; progress notable but limited in product impact. Proposes two concrete avenues to integrate AQP into data platforms (architecture, tooling) to realize practical value. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5345
Venue
SIGMOD
Year
2017
Pagerank
0.00011287495
Overall Rank
1,574 | 89.06%
DOI
10.1145/3055918.3056097

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 42 of 42 citing papers.

Rank Citing Paper Year Venue Pagerank
1,350 Northstar: An Interactive Data Science System 2018 VLDB 0.00012431059
2,501 DBEst: Revisiting Approximate Query Processing Engines with Machine Learning Models 2019 SIGMOD 8.6453446e-05
3,798 Plato: Approximate Analytics over Compressed Time Series with Tight Deterministic Error Guarantees 2020 VLDB 6.7592302e-05
3,944 AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics 2018 SIGMOD 6.6078243e-05
4,375 Sample Debiasing in the Themis Open World Database System 2020 SIGMOD 6.2427076e-05
4,536 Data Series Progressive Similarity Search with Probabilistic Quality Guarantees 2020 SIGMOD 6.104642e-05
4,567 Optimizing Video Analytics with Declarative Model Relationships 2023 VLDB 6.080526e-05
4,884 Relational Data Synthesis using Generative Adversarial Networks: A Design Space Exploration 2020 VLDB 5.8540287e-05
5,072 Optimizing Machine Learning Inference Queries with Correlative Proxy Models 2022 VLDB 5.7185674e-05
5,214 ThalamusDB: Approximate Query Processing on Multi-Modal Data 2024 SIGMOD 5.624434e-05
5,810 Database Benchmarking for Supporting Real-Time Interactive Querying of Large Data 2020 SIGMOD 5.3178017e-05
5,909 At-the-time and Back-in-time Persistent Sketches 2021 SIGMOD 5.2769377e-05
6,233 Mosaic: A Sample-Based Database System for Open World Query Processing 2020 CIDR 5.1451876e-05
6,740 Combining Aggregation and Sampling (Nearly) Optimally for Approximate Query Processing 2021 SIGMOD 4.944395e-05
7,059 Adaptive and Robust Query Execution for Lakehouses at Scale 2024 VLDB 4.8477825e-05
7,164 SKT: A One-Pass Multi-Sketch Data Analytics Accelerator 2021 VLDB 4.8131514e-05
7,339 SpareLLM: Automatically Selecting Task-Specific Minimum-Cost Large Language Models under Equivalence Constraint 2025 SIGMOD 4.7579469e-05
7,358 Weighted Distinct Sampling: Cardinality Estimation for SPJ Queries 2021 SIGMOD 4.7529363e-05
7,534 Enabling Efficient and General Subpopulation Analytics in Multidimensional Data Streams 2022 VLDB 4.7180004e-05
7,610 Learning to be a Statistician: Learned Estimator for Number of Distinct Values 2022 VLDB 4.6965039e-05
7,634 ReStore - Neural Data Completion for Relational Databases 2021 SIGMOD 4.6911382e-05
8,240 Experiences with Approximating Queries in Microsoft’s Production Big-Data Clusters 2019 VLDB 4.5522563e-05
8,393 LAQy: Efficient and Reusable Query Approximations via Lazy Sampling 2023 SIGMOD 4.5280102e-05
8,650 HAP: An Efficient Hamming Space Index Based on Augmented Pigeonhole Principle 2022 SIGMOD 4.4761716e-05
9,006 Hit the Gym: Accelerating Query Execution to Efficiently Bootstrap Behavior Models for Self-Driving Database Management Systems 2024 VLDB 4.4101482e-05
9,296 Controlled Intentional Degradation in Analytical Video Systems 2022 SIGMOD 4.3599613e-05
9,431 PairwiseHist: Fast, Accurate and Space-Efficient Approximate Query Processing with Data Compression 2024 VLDB 4.3434046e-05
9,652 Secure Sampling for Approximate Multi-party Query Processing 2023 SIGMOD 4.3109001e-05
9,696 The Data Interaction Game 2018 SIGMOD 4.3023337e-05
9,786 RALF: Accuracy-Aware Scheduling for Feature Store Maintenance 2024 VLDB 4.2827012e-05
9,807 Demonstration of Accelerating Machine Learning Inference Queries with Correlative Proxy Models 2022 VLDB 4.2805224e-05
10,049 Approximate Query Processing under Updates 2026 SIGMOD 4.1945683e-05
10,116 Stochastic Submodular Data Forgetting 2026 SIGMOD 4.1945683e-05
10,215 Task Cascades for Efficient Unstructured Data Processing 2026 SIGMOD 4.1945683e-05
10,279 ConANN: Conformal Approximate Nearest Neighbor Search 2026 VLDB 4.1945683e-05
10,497 PilotDB: Database-Agnostic Online Approximate Query Processing with A Priori Error Guarantees 2025 SIGMOD 4.1945683e-05
10,639 Cardinality Estimation for Having-Clauses 2025 VLDB 4.1945683e-05
11,074 Confidence Intervals for Private Query Processing 2024 VLDB 4.1945683e-05
11,194 A Step Toward Deep Online Aggregation 2023 SIGMOD 4.1945683e-05
11,539 FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data 2021 VLDB 4.1945683e-05
11,552 BitGourmet: Deterministic Approximation via Optimized Bit Selection 2020 CIDR 4.1945683e-05
11,585 Demonstration of BitGourmet: Data Analysis via Deterministic Approximation 2020 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 50 of 51 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
3 Pig Latin: A Not-So-Foreign Language for Data Processing 2008 SIGMOD 0.0024183614
11 Implementing Data Cubes Efficiently 1996 SIGMOD 0.0011708144
14 Online Aggregation 1997 SIGMOD 0.0010801504
18 On Random Sampling over Joins 1999 SIGMOD 0.00092385438
22 SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets 2008 VLDB 0.0008456613
28 Accurate Estimation Of The Number Of Tuples Satisfying A Condition 1984 SIGMOD 0.00080435857
66 Spark SQL: Relational Data Processing in Spark 2015 SIGMOD 0.00061639801
70 Hive - A Warehousing Solution Over a Map-Reduce Framework 2009 VLDB 0.00059533166
109 Dremel: Interactive Analysis of Web-Scale Datasets 2010 VLDB 0.00048186983
158 Automated Selection of Materialized Views and Indexes for SQL Databases 2000 VLDB 0.00040071492
217 Ripple Joins for Online Aggregation 1999 SIGMOD 0.00033536712
273 Approximate Computation of Multidimensional Aggregates of Sparse Data Using Wavelets 1999 SIGMOD 0.00029390945
378 Towards Estimation Error Guarantees for Distinct Values 2000 PODS 0.0002497492
405 Approximate Query Processing Using Wavelets 2000 VLDB 0.00024057494
429 The Aqua Approximate Query Answering System 1999 SIGMOD 0.00023476494
449 Approximate Query Processing: Taming the TeraBytes! A Tutorial 2001 VLDB 0.00022846068
530 Random Sampling for Histogram Construction: How much is enough? 1998 SIGMOD 0.00020803682
739 Congressional Samples for Approximate Answering of Group-By Queries 2000 SIGMOD 0.00017401518
943 Wander Join: Online Aggregation via Random Walks 2016 SIGMOD 0.00015145883
1,098 Trill: A High-Performance Incremental Query Processor for Diverse Analytics 2015 VLDB 0.00014114442
1,152 Blink and It's Done: Interactive Queries on Very Large Data 2012 VLDB 0.00013645792
1,193 Join Size Estimation Subject to Filter Conditions 2015 VLDB 0.00013414989
1,260 Dynamic Sample Selection for Approximate Query Processing 2003 SIGMOD 0.00012993347
1,323 Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters 2016 SIGMOD 0.00012601997
1,335 ICICLES: Self-tuning Samples for Approximate Query Answering 2000 VLDB 0.00012502131
1,425 Scalable Approximate Query Processing With The DBO Engine 2007 SIGMOD 0.00012051353
1,464 Online Aggregation for Large MapReduce Jobs 2011 VLDB 0.00011865546
1,874 Knowing When You’re Wrong: Building Fast and Reliable Approximate Query Processing Systems 2014 SIGMOD 0.00010244443
1,909 SciBORQ: Scientific data management with Bounds On Runtime and Quality 2011 CIDR 0.00010121304
2,011 Rapid Sampling for Visualizations with Ordering Guarantees 2015 VLDB 9.7964875e-05
2,355 G-OLA: Generalized On-Line Aggregation for Interactive Analysis on Big Data 2015 SIGMOD 8.9677847e-05
2,365 The Analytical Bootstrap: a New Method for Fast Error Estimation in Approximate Query Processing 2014 SIGMOD 8.9551432e-05
2,368 Online Maintenance of Very Large Random Samples 2004 SIGMOD 8.9501526e-05
2,580 Sample + Seek: Approximating Aggregates with Distribution Precision Guarantee 2016 SIGMOD 8.5058814e-05
2,616 DAQ: A New Paradigm for Approximate Query Processing 2015 VLDB 8.4471955e-05
2,808 A Robust, Optimization-Based Approach for Approximate Answering of Aggregate Queries 2001 SIGMOD 8.0870741e-05
2,995 A Sampling Algebra for Aggregate Estimation 2013 VLDB 7.7587199e-05
3,013 Cardinality Estimation Using Sample Views with Quality Assurance 2007 SIGMOD 7.7137441e-05
3,310 Optimal and Approximate Computation of Summary Statistics for Range Aggregates 2001 PODS 7.2408955e-05
4,052 Interactive Analysis of Web-Scale Data 2009 CIDR 6.4936745e-05
4,211 Querying Big Graphs within Bounded Resources 2014 SIGMOD 6.3563454e-05
4,546 Bounded Conjunctive Queries 2014 VLDB 6.0987778e-05
5,262 SnappyData: A Hybrid Transactional Analytical Store Built On Spark 2016 SIGMOD 5.5977349e-05
5,581 CliffGuard: A Principled Framework for Finding Robust Database Designs 2015 SIGMOD 5.424205e-05
5,879 Fast and Near–Optimal Algorithms for Approximating Distributions by Histograms 2015 PODS 5.2908101e-05
6,136 Scalable Progressive Analytics on Big Data in the Cloud 2013 VLDB 5.1928748e-05
7,085 Querying Big Data by Accessing Small Data 2015 PODS 4.8388174e-05
7,413 On Scale Independence for Querying Big Data 2014 PODS 4.7358047e-05
8,715 Data Driven Approximation with Bounded Resources 2017 VLDB 4.4619052e-05
8,728 Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views 2015 VLDB 4.4589711e-05
Previous Page 1 / 2 Next

Semantically Similar Papers