Database Paper Browser

Back to papers

Data Canopy: Accelerating Exploratory Statistical Analysis

Summary: Data Canopy provides an in-memory library of basic aggregates to reuse statistics across overlapping data parts, cutting recomputation in exploratory analysis. It decomposes stats into reusable aggregates, with storage/maintenance and hardware-aware tuning, yielding ~10x speedup after 100 queries vs. state-of-the-art. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5427
Venue
SIGMOD
Year
2017
Pagerank
6.6731435e-05
Overall Rank
3,878 | 73.03%
DOI
10.1145/3035918.3064051

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 9 of 9 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 41 of 41 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
52 Database Architecture Optimized for the new Bottleneck: Memory Access 1999 VLDB 0.00066474881
55 Efficiently Updating Materialized Views 1986 SIGMOD 0.00065762967
269 Fast Incremental Maintenance of Approximate Histograms 1997 VLDB 0.00029656549
366 An Array-Based Algorithm for Simultaneous Multidimensional Aggregates 1997 SIGMOD 0.0002552977
368 Small Materialized Aggregates: A Light Weight Index Structure for Data Warehousing 1998 VLDB 0.000254931
414 Timber: A Sophisticated Relation Browser 1982 VLDB 0.0002388204
472 Bottom-Up Computation of Sparse and Iceberg CUBEs 1999 SIGMOD 0.00022346384
481 Incremental Maintenance of Views with Duplicates 1995 SIGMOD 0.00022167223
516 AutoAdmin "What-if" Index Analysis Utility 1998 SIGMOD 0.00021196031
530 Random Sampling for Histogram Construction: How much is enough? 1998 SIGMOD 0.00020803682
591 TelegraphCQ: Continuous Dataflow Processing 2003 SIGMOD 0.00019569071
703 Query Execution Techniques for Caching Expensive Methods 1996 SIGMOD 0.00017916705
785 StatStream: Statistical Monitoring of Thousands of Data Streams in Real Time 2002 VLDB 0.00016664156
834 Learning Linear Regression Models over Factorized Joins 2016 SIGMOD 0.00016135159
962 Maintenance of Data Cubes and Summary Tables in a Warehouse 1997 SIGMOD 0.00014986226
1,098 Trill: A High-Performance Incremental Query Processor for Diverse Analytics 2015 VLDB 0.00014114442
1,509 Discovering Queries based on Example Tuples 2014 SIGMOD 0.00011612727
1,552 Overview of Data Exploration Techniques 2015 SIGMOD 0.00011408814
1,587 Dynamic Prefetching of Data Tiles for Interactive Visualization 2016 SIGMOD 0.00011245116
1,786 Fast Approximate Correlation for Massive Time-series Data 2010 SIGMOD 0.00010558719
1,797 Effective Use of Block-Level Sampling in Statistics Estimation 2004 SIGMOD 0.00010523169
1,887 Caching Multidimensional Queries Using Chunks 1998 SIGMOD 0.00010204659
1,909 SciBORQ: Scientific data management with Bounds On Runtime and Quality 2011 CIDR 0.00010121304
1,918 VizDeck: Self-Organizing Dashboards for Visual Analytics 2012 SIGMOD 0.00010097599
2,190 Star-Cubing: Computing Iceberg Cubes by Top-Down and Bottom-Up Integration 2003 VLDB 9.3317645e-05
2,415 VizQL: A Language for Query, Analysis and Visualization 2006 SIGMOD 8.8639497e-05
2,590 Answering Queries from Statistics and Probabilistic Views 2005 VLDB 8.483194e-05
2,610 i3: Intelligent, Interactive Investigation of OLAP data cubes 2000 SIGMOD 8.4571036e-05
2,667 Cumulon: Optimizing Statistical Data Analysis in the Cloud 2013 SIGMOD 8.3413995e-05
2,733 The Case for Data Visualization Management Systems [Vision Paper] 2014 VLDB 8.2078862e-05
2,930 Assessing and Ranking Structural Correlations in Graphs 2011 SIGMOD 7.8723983e-05
2,965 SQLShare: Results from a Multi-Year SQL-as-a-Service Experiment 2016 SIGMOD 7.8059273e-05
3,070 Explore-by-Example: An Automatic Query Steering Framework for Interactive Data Exploration 2014 SIGMOD 7.6137064e-05
3,594 Continuous Sampling for Online Aggregation Over Multiple Queries 2010 SIGMOD 6.9381343e-05
4,599 Playful Query Specification with DataPlay 2012 VLDB 6.0583418e-05
4,834 Querying Without Keyboards 2013 CIDR 5.8912496e-05
5,987 Sampling Cube: A Framework for Statistical OLAP Over Sampling Data 2008 SIGMOD 5.2432535e-05
7,921 Information Retrieval from an Incomplete Data Cube 1996 VLDB 4.6161463e-05
8,507 ARCube: Supporting Ranking Aggregate Queries in Partially Materialized Data Cubes 2008 SIGMOD 4.4955397e-05
9,350 Tracking Set Correlations at Large Scale 2014 SIGMOD 4.3525655e-05
9,818 Structures, Semantics and Statistics 2004 VLDB 4.2777808e-05
Previous Page 1 / 1 Next

Semantically Similar Papers