Database Paper Browser

Back to papers

Congressional Samples for Approximate Answering of Group-By Queries

Summary: Proposes congressional samples, a hybrid of uniform and biased samples, to maximize group-by accuracy under fixed space. One-pass construction with incremental maintenance without accessing the base relation, plus query-rewriting strategies, validated on TPC-D. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
3209
Venue
SIGMOD
Year
2000
Pagerank
0.00017401518
Overall Rank
739 | 94.87%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 48 of 48 citing papers.

Rank Citing Paper Year Venue Pagerank
43 Models and Issues in Data Stream Systems 2002 PODS 0.00072723062
308 Distinct Sampling for Highly-Accurate Answers to Distinct Values Queries and Event Reports 2001 VLDB 0.00028142852
429 The Aqua Approximate Query Answering System 1999 SIGMOD 0.00023476494
449 Approximate Query Processing: Taming the TeraBytes! A Tutorial 2001 VLDB 0.00022846068
549 Tracking Join and Self-Join Sizes in Limited Storage 1999 PODS 0.00020376603
967 Aqua: A Fast Decision Support System Using Approximate Query Answers 1999 VLDB 0.00014959939
1,260 Dynamic Sample Selection for Approximate Query Processing 2003 SIGMOD 0.00012993347
1,323 Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters 2016 SIGMOD 0.00012601997
1,350 Northstar: An Interactive Data Science System 2018 VLDB 0.00012431059
1,443 Compressing SQL Workloads 2002 SIGMOD 0.00011947004
1,574 Approximate Query Processing: No Silver Bullet 2017 SIGMOD 0.00011287495
2,011 Rapid Sampling for Visualizations with Ordering Guarantees 2015 VLDB 9.7964875e-05
2,184 A Sample-and-Clean Framework for Fast and Accurate Query Processing on Dirty Data 2014 SIGMOD 9.3429789e-05
2,203 Independent Range Sampling 2014 PODS 9.2981095e-05
2,368 Online Maintenance of Very Large Random Samples 2004 SIGMOD 8.9501526e-05
2,580 Sample + Seek: Approximating Aggregates with Distribution Precision Guarantee 2016 SIGMOD 8.5058814e-05
2,662 Dwarf: Shrinking the PetaCube 2002 SIGMOD 8.3532302e-05
2,716 Davos: A System for Interactive Data-Driven Decision Making 2021 VLDB 8.2429172e-05
2,808 A Robust, Optimization-Based Approach for Approximate Answering of Aggregate Queries 2001 SIGMOD 8.0870741e-05
2,852 MRI: Meaningful Interpretations of Collaborative Ratings 2011 VLDB 8.0151391e-05
3,441 Interactive Data Exploration Using Semantic Windows 2014 SIGMOD 7.0914601e-05
3,594 Continuous Sampling for Online Aggregation Over Multiple Queries 2010 SIGMOD 6.9381343e-05
3,842 Turbo-Charging Estimate Convergence in DBO 2009 VLDB 6.7102374e-05
3,944 AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics 2018 SIGMOD 6.6078243e-05
4,014 Exploiting Correlations for Expensive Predicate Evaluation 2015 SIGMOD 6.5273084e-05
4,030 Revisiting Reuse for Approximate Query Processing 2017 VLDB 6.5129665e-05
4,287 Primitives for Workload Summarization and Implications for SQL 2003 VLDB 6.2891702e-05
4,435 Sampling Dirty Data for Matching Attributes 2010 SIGMOD 6.1918164e-05
4,681 Adaptive Sampling for Rapidly Matching Histograms 2018 VLDB 6.0034918e-05
5,214 ThalamusDB: Approximate Query Processing on Multi-Modal Data 2024 SIGMOD 5.624434e-05
5,252 Error-bounded Sampling for Analytics on Big Sparse Data 2014 VLDB 5.6024389e-05
5,539 Supporting Time-Constrained SQL Queries in Oracle 2007 VLDB 5.4503121e-05
5,581 CliffGuard: A Principled Framework for Finding Robust Database Designs 2015 SIGMOD 5.424205e-05
5,817 Derby/S: A DBMS for Sample-Based Query Answering 2006 SIGMOD 5.3156799e-05
6,491 Robust Estimation With Sampling and Approximate Pre-Aggregation 2003 VLDB 5.0429323e-05
6,548 Query Sampling in DB2 Universal Database 2004 SIGMOD 5.0181595e-05
6,740 Combining Aggregation and Sampling (Nearly) Optimally for Approximate Query Processing 2021 SIGMOD 4.944395e-05
7,081 The Polynomial Complexity of Fully Materialized Coalesced Cubes 2004 VLDB 4.8413336e-05
7,534 Enabling Efficient and General Subpopulation Analytics in Multidimensional Data Streams 2022 VLDB 4.7180004e-05
7,714 Identifying Insufficient Data Coverage in Databases with Multiple Relations 2020 VLDB 4.6700455e-05
8,643 One Size Does Not Fit All: A Bandit-Based Sampler Combination Framework with Theoretical Guarantees 2022 SIGMOD 4.4777916e-05
8,715 Data Driven Approximation with Bounded Resources 2017 VLDB 4.4619052e-05
9,621 ShadowAQP: Efficient Approximate Group-by and Join Query via Attribute-oriented Sample Size Allocation and Data Generation 2023 VLDB 4.3167167e-05
10,227 Sample-based Distinct Cardinality Estimation for Multiple Attributes in Multi-Dataset Queries 2026 VLDB 4.1945683e-05
10,481 FAAQP: Fast and Accurate Approximate Query Processing based on Bitmap-augmented Sum-Product Network 2025 SIGMOD 4.1945683e-05
10,497 PilotDB: Database-Agnostic Online Approximate Query Processing with A Priori Error Guarantees 2025 SIGMOD 4.1945683e-05
11,539 FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data 2021 VLDB 4.1945683e-05
12,626 Estimating the Output Cardinality of Partial Preaggregation with a Measure of Clusteredness 2003 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 10 of 10 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers