PilotDB: Database-Agnostic Online Approximate Query Processing with A Priori Error Guarantees

Summary: PilotDB provides database-agnostic online AQP with a priori error guarantees using TAQA; BSAP enables fast block-level sampling. Prototype middleware on PostgreSQL, SQL Server, DuckDB; up to 126x speedups at 5% guaranteed error, with no DBMS changes. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 7255
Venue: SIGMOD
Year: 2025
Pagerank: 4.3648789e-05
Overall Rank: 9,238 | 35.80%
DOI: 10.1145/3725335

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 1 of 1 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
10,277	SemBench: A Benchmark for Semantic Query Processing Engines	2026	VLDB	4.1905499e-05

Outgoing Citations (Sorted by Pagerank)

Showing 46 of 46 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
14	Online Aggregation	1997	SIGMOD	0.0010813443
18	On Random Sampling over Joins	1999	SIGMOD	0.00092569117
28	Accurate Estimation Of The Number Of Tuples Satisfying A Condition	1984	SIGMOD	0.00080571183
46	Simple Random Sampling from Relational Databases	1986	VLDB	0.00071588702
63	Improved Histograms for Selectivity Estimation of Range Predicates	1996	SIGMOD	0.00063595699
185	DuckDB: an Embeddable Analytical Database	2019	SIGMOD	0.00036529607
212	Join Synopses for Approximate Query Answering	1999	SIGMOD	0.00033997204
216	Ripple Joins for Online Aggregation	1999	SIGMOD	0.00033560137
291	Error-Constrained COUNT Query Evaluation in Relational Databases	1991	SIGMOD	0.00028778973
431	The Aqua Approximate Query Answering System	1999	SIGMOD	0.00023397171
736	Congressional Samples for Approximate Answering of Group-By Queries	2000	SIGMOD	0.00017414831
960	Aqua: A Fast Decision Support System Using Approximate Query Answers	1999	VLDB	0.00015031055
1,161	VerdictDB: Universalizing Approximate Query Processing	2018	SIGMOD	0.00013579831
1,257	Dynamic Sample Selection for Approximate Query Processing	2003	SIGMOD	0.00013002384
1,320	Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters	2016	SIGMOD	0.00012606067
1,331	ICICLES: Self-tuning Samples for Approximate Query Answering	2000	VLDB	0.00012553948
1,372	Random Sampling over Joins Revisited	2018	SIGMOD	0.0001233325
1,425	Scalable Approximate Query Processing With The DBO Engine	2007	SIGMOD	0.00012044433
1,451	Online Aggregation for Large MapReduce Jobs	2011	VLDB	0.00011925842
1,574	Approximate Query Processing: No Silver Bullet	2017	SIGMOD	0.00011289028
2,354	G-OLA: Generalized On-Line Aggregation for Interactive Analysis on Big Data	2015	SIGMOD	8.9748896e-05
2,424	The Analytical Bootstrap: a New Method for Fast Error Estimation in Approximate Query Processing	2014	SIGMOD	8.8415494e-05
2,494	DBEst: Revisiting Approximate Query Processing Engines with Machine Learning Models	2019	SIGMOD	8.6457436e-05
2,848	A General-Purpose Counting Filter: Making Every Bit Count	2017	SIGMOD	8.0202739e-05
2,937	DSB: A Decision Support Benchmark for Workload-Driven and Traditional Database Systems	2021	VLDB	7.8552033e-05
2,995	A Sampling Algebra for Aggregate Estimation	2013	VLDB	7.7606324e-05
3,066	Learning a Partitioning Advisor for Cloud Databases	2020	SIGMOD	7.6255556e-05
3,133	Relational Confidence Bounds Are Easy With The Bootstrap*	2005	SIGMOD	7.4979168e-05
3,150	The MemSQL Query Optimizer: A modern optimizer for real-time analytics in a distributed database	2016	VLDB	7.4765759e-05
3,808	Turbo-Charging Estimate Convergence in DBO	2009	VLDB	6.7416988e-05
4,020	Revisiting Reuse for Approximate Query Processing	2017	VLDB	6.5209063e-05
4,703	Accelerating Approximate Aggregation Queries with Expensive Predicates	2021	VLDB	5.9793615e-05
4,815	DigitHist: a Histogram-Based Data Summary with Tight Error Bounds	2017	VLDB	5.8978716e-05
5,578	CliffGuard: A Principled Framework for Finding Robust Database Designs	2015	SIGMOD	5.4231783e-05
5,815	Database Benchmarking for Supporting Real-Time Interactive Querying of Large Data	2020	SIGMOD	5.3158788e-05
5,867	ABS: a System for Scalable Approximate Queries with Accuracy Guarantees	2014	SIGMOD	5.2933639e-05
6,332	Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine	2024	SIGMOD	5.1021765e-05
6,402	Approximate Query Engines: Commercial Challenges and Research Opportunities	2017	SIGMOD	5.0725227e-05
6,481	Joins on Samples: A Theoretical Guide for Practitioners	2020	VLDB	5.039683e-05
7,911	Accelerating Aggregation Queries on Unstructured Streams of Data	2023	VLDB	4.6143141e-05
8,586	Bias-Aware Sketches	2017	VLDB	4.4855966e-05
8,606	ProgressiveDB – Progressive Data Analytics as a Middleware	2019	VLDB	4.4811623e-05
9,032	Making Data Clouds Smarter at Keebo: Automated Warehouse Optimization using Data Learning	2023	SIGMOD	4.3998185e-05
9,944	DeepOLA: Online Aggregation for Deeply Nested Queries	2022	SIGMOD	4.240067e-05
9,945	AB-tree: Index for Concurrent Random Sampling and Updates	2022	VLDB	4.240067e-05
9,946	Distributed Wavelet Thresholding for Maximum Error Metrics	2016	SIGMOD	4.240067e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
5,867	ABS: a System for Scalable Approximate Queries with Accuracy Guarantees	2014	SIGMOD	5.2933639e-05
2,603	DAQ: A New Paradigm for Approximate Query Processing	2015	VLDB	8.4634633e-05
10,265	AQD: Online Adaptive Query Dispatcher for HTAP Databases	2026	VLDB	4.1905499e-05
6,724	Combining Aggregation and Sampling (Nearly) Optimally for Approximate Query Processing	2021	SIGMOD	4.9449472e-05
1,161	VerdictDB: Universalizing Approximate Query Processing	2018	SIGMOD	0.00013579831
2,424	The Analytical Bootstrap: a New Method for Fast Error Estimation in Approximate Query Processing	2014	SIGMOD	8.8415494e-05
1,867	Knowing When You’re Wrong: Building Fast and Reliable Approximate Query Processing Systems	2014	SIGMOD	0.00010264932
11,287	Approximate Queries over Concurrent Updates	2023	VLDB	4.1905499e-05
2,583	Sample + Seek: Approximating Aggregates with Distribution Precision Guarantee	2016	SIGMOD	8.4973431e-05
10,349	Efficient Approximate Query Processing with Block Sampling	2025	CIDR	4.1905499e-05