Database Paper Browser

Back to papers

VerdictDB: Universalizing Approximate Query Processing

Summary: VerdictDB: universal, database-agnostic AQP via driver-level middleware that rewrites queries without backend changes. Provides approximate answers with error estimates across engines (Impala, Spark SQL, Redshift), delivering 171x speedups and <3% error. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5551
Venue
SIGMOD
Year
2018
Pagerank
0.00013319541
Overall Rank
1,204 | 91.63%
DOI
10.1145/3183713.3196905

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 48 of 48 citing papers.

Rank Citing Paper Year Venue Pagerank
608 DeepDB: Learn from Data, not from Queries! 2020 VLDB 0.00019235898
1,427 Towards Scalable Dataframe Systems 2020 VLDB 0.0001204248
1,703 Are We Ready For Learned Cardinality Estimation? 2021 VLDB 0.00010836769
2,359 Data Market Platforms: Trading Data Assets to Solve Data Problems 2020 VLDB 8.9607667e-05
2,501 DBEst: Revisiting Approximate Query Processing Engines with Machine Learning Models 2019 SIGMOD 8.6453446e-05
3,565 Cache-Craft: Managing Chunk-Caches for Efficient Retrieval-Augmented Generation 2025 SIGMOD 6.9655362e-05
3,968 QUAD: Quadratic-Bound-based Kernel Density Visualization 2020 SIGMOD 6.5793715e-05
5,806 BlinkML: Efficient Maximum Likelihood Estimation with Probabilistic Guarantees 2019 SIGMOD 5.3200643e-05
5,810 Database Benchmarking for Supporting Real-Time Interactive Querying of Large Data 2020 SIGMOD 5.3178017e-05
5,951 PGMJoins: Random Join Sampling with Graphical Models 2021 SIGMOD 5.2592385e-05
6,230 Learned Approximate Query Processing: Make it Light, Accurate and Fast 2021 CIDR 5.145989e-05
6,493 Joins on Samples: A Theoretical Guide for Practitioners 2020 VLDB 5.0424713e-05
6,740 Combining Aggregation and Sampling (Nearly) Optimally for Approximate Query Processing 2021 SIGMOD 4.944395e-05
7,073 Marviq: Quality-Aware Geospatial Visualization of Range-Selection Queries Using Materialization 2020 SIGMOD 4.842703e-05
7,251 Learning to Sample: Counting with Complex Queries 2020 VLDB 4.7890519e-05
7,339 SpareLLM: Automatically Selecting Task-Specific Minimum-Cost Large Language Models under Equivalence Constraint 2025 SIGMOD 4.7579469e-05
7,534 Enabling Efficient and General Subpopulation Analytics in Multidimensional Data Streams 2022 VLDB 4.7180004e-05
8,080 Biathlon: Harnessing Model Resilience for Accelerating ML Inference Pipelines 2024 VLDB 4.5911668e-05
8,240 Experiences with Approximating Queries in Microsoft’s Production Big-Data Clusters 2019 VLDB 4.5522563e-05
8,393 LAQy: Efficient and Reusable Query Approximations via Lazy Sampling 2023 SIGMOD 4.5280102e-05
8,622 ProgressiveDB – Progressive Data Analytics as a Middleware 2019 VLDB 4.4834877e-05
8,643 One Size Does Not Fit All: A Bandit-Based Sampler Combination Framework with Theoretical Guarantees 2022 SIGMOD 4.4777916e-05
9,049 JENNER: Just-in-time Enrichment in Query Processing 2022 VLDB 4.4039656e-05
9,107 NeuroSketch: Fast and Approximate Evaluation of Range Aggregate Queries with Neural Networks 2023 SIGMOD 4.3950706e-05
9,431 PairwiseHist: Fast, Accurate and Space-Efficient Approximate Query Processing with Data Compression 2024 VLDB 4.3434046e-05
9,621 ShadowAQP: Efficient Approximate Group-by and Join Query via Attribute-oriented Sample Size Allocation and Data Generation 2023 VLDB 4.3167167e-05
9,757 Efficient Insights Discovery through Conditional Generative Model based Query Approximation 2022 SIGMOD 4.2893233e-05
9,758 Practical Dynamic Extension for Sampling Indexes 2023 SIGMOD 4.2879116e-05
9,949 AB-tree: Index for Concurrent Random Sampling and Updates 2022 VLDB 4.2421586e-05
10,223 On Fair Epsilon Net and Geometric Hitting Set 2026 VLDB 4.1945683e-05
10,254 Secure Multi-Party Sampling over Joins 2026 VLDB 4.1945683e-05
10,279 ConANN: Conformal Approximate Nearest Neighbor Search 2026 VLDB 4.1945683e-05
10,337 Efficient Approximate Query Processing with Block Sampling 2025 CIDR 4.1945683e-05
10,434 Demo of Kishu: Time-Traveling for Computational Notebooks 2025 SIGMOD 4.1945683e-05
10,481 FAAQP: Fast and Accurate Approximate Query Processing based on Bitmap-augmented Sum-Product Network 2025 SIGMOD 4.1945683e-05
10,497 PilotDB: Database-Agnostic Online Approximate Query Processing with A Priori Error Guarantees 2025 SIGMOD 4.1945683e-05
10,565 Holistic query Approximation via RL Modeling 2025 VLDB 4.1945683e-05
10,608 Approximation-First Timeseries Query At Scale 2025 VLDB 4.1945683e-05
10,639 Cardinality Estimation for Having-Clauses 2025 VLDB 4.1945683e-05
10,881 Datamap-Driven Tabular Coreset Selection for Classifier Training 2025 VLDB 4.1945683e-05
11,194 A Step Toward Deep Online Aggregation 2023 SIGMOD 4.1945683e-05
11,217 Efficient Approximation Framework for Attribute Recommendation 2023 SIGMOD 4.1945683e-05
11,285 Approximate Queries over Concurrent Updates 2023 VLDB 4.1945683e-05
11,427 Accelerating Complex Analytics using Speculation 2021 CIDR 4.1945683e-05
11,453 XLJoins 2021 SIGMOD 4.1945683e-05
11,514 ATLANTIC: Making Database Differentially Private and Faster with Accuracy Guarantee 2021 VLDB 4.1945683e-05
11,539 FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data 2021 VLDB 4.1945683e-05
11,711 Demonstration of VerdictDB, the Platform-Independent AQP System 2018 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 39 of 39 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
14 Online Aggregation 1997 SIGMOD 0.0010801504
18 On Random Sampling over Joins 1999 SIGMOD 0.00092385438
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
211 Join Synopses for Approximate Query Answering 1999 SIGMOD 0.00033981214
217 Ripple Joins for Online Aggregation 1999 SIGMOD 0.00033536712
429 The Aqua Approximate Query Answering System 1999 SIGMOD 0.00023476494
848 Approximate Counts and Quantiles over Sliding Windows 2004 PODS 0.0001597308
943 Wander Join: Online Aggregation via Random Walks 2016 SIGMOD 0.00015145883
1,152 Blink and It's Done: Interactive Queries on Very Large Data 2012 VLDB 0.00013645792
1,260 Dynamic Sample Selection for Approximate Query Processing 2003 SIGMOD 0.00012993347
1,323 Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters 2016 SIGMOD 0.00012601997
1,335 ICICLES: Self-tuning Samples for Approximate Query Answering 2000 VLDB 0.00012502131
1,464 Online Aggregation for Large MapReduce Jobs 2011 VLDB 0.00011865546
1,874 Knowing When You’re Wrong: Building Fast and Reliable Approximate Query Processing Systems 2014 SIGMOD 0.00010244443
1,909 SciBORQ: Scientific data management with Bounds On Runtime and Quality 2011 CIDR 0.00010121304
2,230 Performance and Resource Modeling in Highly-Concurrent OLTP Workloads 2013 SIGMOD 9.2322426e-05
2,355 G-OLA: Generalized On-Line Aggregation for Interactive Analysis on Big Data 2015 SIGMOD 8.9677847e-05
2,365 The Analytical Bootstrap: a New Method for Fast Error Estimation in Approximate Query Processing 2014 SIGMOD 8.9551432e-05
2,588 Database Learning: Toward a Database that Becomes Smarter Every Time 2017 SIGMOD 8.4909562e-05
2,616 DAQ: A New Paradigm for Approximate Query Processing 2015 VLDB 8.4471955e-05
2,779 Hashed Samples: Selectivity Estimators For Set Similarity Selection Queries 2008 VLDB 8.1320575e-05
2,995 A Sampling Algebra for Aggregate Estimation 2013 VLDB 7.7587199e-05
3,118 Scaling Up Crowd-Sourcing to Very Large Datasets: A Case for Active Learning 2015 VLDB 7.5379338e-05
3,167 Relational Confidence Bounds Are Easy With The Bootstrap* 2005 SIGMOD 7.4523397e-05
3,279 Early Accurate Results for Advanced Analytics on MapReduce 2012 VLDB 7.2855494e-05
3,333 SnappyData: A Unified Cluster for Streaming, Transactions, and Interactive Analytics 2017 CIDR 7.2093479e-05
3,594 Continuous Sampling for Online Aggregation Over Multiple Queries 2010 SIGMOD 6.9381343e-05
4,013 DBSeer: Resource and Performance Prediction for Building a Next Generation Database Cloud 2013 CIDR 6.529956e-05
4,030 Revisiting Reuse for Approximate Query Processing 2017 VLDB 6.5129665e-05
4,052 Interactive Analysis of Web-Scale Data 2009 CIDR 6.4936745e-05
5,224 Neighbor-Sensitive Hashing 2016 VLDB 5.6197981e-05
5,262 SnappyData: A Hybrid Transactional Analytical Store Built On Spark 2016 SIGMOD 5.5977349e-05
5,581 CliffGuard: A Principled Framework for Finding Robust Database Designs 2015 SIGMOD 5.424205e-05
5,868 ABS: a System for Scalable Approximate Queries with Accuracy Guarantees 2014 SIGMOD 5.2959352e-05
6,169 Approximate Lifted Inference with Probabilistic Databases 2015 VLDB 5.1716068e-05
6,411 Approximate Query Engines: Commercial Challenges and Research Opportunities 2017 SIGMOD 5.0752468e-05
7,085 Querying Big Data by Accessing Small Data 2015 PODS 4.8388174e-05
11,711 Demonstration of VerdictDB, the Platform-Independent AQP System 2018 SIGMOD 4.1945683e-05
13,354 Verdict: A System for Stochastic Query Planning 2015 CIDR -
Previous Page 1 / 1 Next

Semantically Similar Papers