Database Paper Browser

Back to papers

Blink and It's Done: Interactive Queries on Very Large Data

Summary: BlinkDB is a massively parallel, sampling-based approximate query processing framework for interactive SPJA queries on petabyte-scale data, delivering real-time results with statistical error guarantees atop Hive/HDFS. Demonstrates up to 150x speedups vs Hive MR and 10-150x vs Shark on tens of terabytes across ~100 machines, with 2-10% error and fault-tolerant, scalable deployment. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10431
Venue
VLDB
Year
2012
Pagerank
0.00013645792
Overall Rank
1,152 | 91.99%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 28 of 28 citing papers.

Rank Citing Paper Year Venue Pagerank
943 Wander Join: Online Aggregation via Random Walks 2016 SIGMOD 0.00015145883
1,204 VerdictDB: Universalizing Approximate Query Processing 2018 SIGMOD 0.00013319541
1,574 Approximate Query Processing: No Silver Bullet 2017 SIGMOD 0.00011287495
1,805 M4: A Visualization-Oriented Time Series Data Aggregation 2014 VLDB 0.00010493299
1,840 dbTouch: Analytics at your Fingertips 2013 CIDR 0.0001034905
1,846 Combining User Interaction, Speculative Query Execution and Sampling in the DICE System 2014 VLDB 0.00010335419
1,874 Knowing When You’re Wrong: Building Fast and Reliable Approximate Query Processing Systems 2014 SIGMOD 0.00010244443
2,588 Database Learning: Toward a Database that Becomes Smarter Every Time 2017 SIGMOD 8.4909562e-05
2,659 Multi-Objective Parametric Query Optimization 2015 VLDB 8.3604734e-05
3,070 Explore-by-Example: An Automatic Query Steering Framework for Interactive Data Exploration 2014 SIGMOD 7.6137064e-05
3,333 SnappyData: A Unified Cluster for Streaming, Transactions, and Interactive Analytics 2017 CIDR 7.2093479e-05
4,029 Spatial Online Sampling and Aggregation 2016 VLDB 6.51315e-05
4,167 Scalable Distributed Stream Join Processing 2015 SIGMOD 6.3919506e-05
4,248 Hyper Dimension Shuffle: Efficient Data Repartition at Petabyte Scale in SCOPE 2019 VLDB 6.3247927e-05
4,874 Approximation Schemes for Many-Objective Query Optimization 2014 SIGMOD 5.8594632e-05
5,075 An Incremental Anytime Algorithm for Multi-Objective Query Optimization 2015 SIGMOD 5.7172118e-05
5,262 SnappyData: A Hybrid Transactional Analytical Store Built On Spark 2016 SIGMOD 5.5977349e-05
5,581 CliffGuard: A Principled Framework for Finding Robust Database Designs 2015 SIGMOD 5.424205e-05
6,493 Joins on Samples: A Theoretical Guide for Practitioners 2020 VLDB 5.0424713e-05
6,907 Continuous Prefetch for Interactive Data Applications 2020 VLDB 4.8925595e-05
7,339 SpareLLM: Automatically Selecting Task-Specific Minimum-Cost Large Language Models under Equivalence Constraint 2025 SIGMOD 4.7579469e-05
7,759 Dscaler: Synthetically Scaling A Given Relational Database 2016 VLDB 4.6593145e-05
8,689 Wander Join: Online Aggregation for Joins 2016 SIGMOD 4.4667389e-05
8,725 A Fast Randomized Algorithm for Multi-Objective Query Optimization 2016 SIGMOD 4.4600243e-05
9,305 Parallelizing Query Optimization on Shared-Nothing Architectures 2016 VLDB 4.3577129e-05
9,630 Ziggy: Characterizing Query Results for Data Explorers 2016 VLDB 4.3138911e-05
11,711 Demonstration of VerdictDB, the Platform-Independent AQP System 2018 SIGMOD 4.1945683e-05
11,913 STORM: Spatio-Temporal Online Reasoning and Management of Large Spatio-Temporal Data 2015 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers