Database Paper Browser

Back to papers

Small Materialized Aggregates: A Light Weight Index Structure for Data Warehousing

Summary: Small Materialized Aggregates (SMAs) are a light-weight, flexible index for data warehousing, precomputing many aggregates over small to medium buckets. Applied to TPC-D, SMAs deliver about 100x speedups on Query 1 and offer tunable strategies for efficient query processing. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
8512
Venue
VLDB
Year
1998
Pagerank
0.000254931
Overall Rank
368 | 97.45%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 64 citing papers.

Rank Citing Paper Year Venue Pagerank
35 MonetDB/X100: Hyper-Pipelining Query Execution 2005 CIDR 0.00076197749
102 The Case for Learned Index Structures 2018 SIGMOD 0.00049545203
167 The Snowflake Elastic Data Warehouse 2016 SIGMOD 0.00039180521
310 The Vertica Analytic Database: C-Store 7 Years Later 2012 VLDB 0.00028132402
426 Amazon Redshift and the Case for Simpler Data Warehouses 2015 SIGMOD 0.00023594359
1,026 Cooperative Scans: Dynamic Bandwidth Sharing in a DBMS 2007 VLDB 0.00014589172
1,263 Data Blocks: Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation 2016 SIGMOD 0.00012982857
1,375 FITing-Tree: A Data-aware Index Structure 2019 SIGMOD 0.00012303141
1,470 Processing a Trillion Cells per Mouse Click 2012 VLDB 0.00011833779
1,477 Fine-grained Partitioning for Aggressive Data Skipping 2014 SIGMOD 0.00011770865
1,611 Qd-tree: Learning Data Layouts for Big Data Analytics 2020 SIGMOD 0.00011147324
1,913 BF-Tree: Approximate Tree Indexing 2014 VLDB 0.00010113937
1,949 Positional Update Handling in Column Stores 2010 SIGMOD 9.9864085e-05
2,322 Instant Loading for Main Memory Databases 2013 VLDB 9.034874e-05
2,916 Quantifying TPC-H Choke Points and Their Optimizations 2020 VLDB 7.9068048e-05
3,488 Optimal Column Layout for Hybrid Workloads 2019 VLDB 7.0479329e-05
3,608 Column Sketches: A Scan Accelerator for Rapid and Robust Predicate Evaluation 2018 SIGMOD 6.924272e-05
3,737 Skipping-oriented Partitioning for Columnar Layouts 2017 VLDB 6.8033227e-05
3,878 Data Canopy: Accelerating Exploratory Statistical Analysis 2017 SIGMOD 6.6731435e-05
3,891 Slalom: Coasting Through Raw Data via Adaptive Partitioning and Indexing 2017 VLDB 6.659442e-05
3,912 Two Birds, One Stone: A Fast, yet Lightweight, Indexing Scheme for Modern Database Systems 2017 VLDB 6.6354964e-05
3,944 AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics 2018 SIGMOD 6.6078243e-05
4,158 Performance-Optimal Filtering: Bloom Overtakes Cuckoo at High Throughput 2019 VLDB 6.3994318e-05
4,161 Access Path Selection in Main-Memory Optimized Data Systems: Should I Scan or Should I Probe? 2017 SIGMOD 6.3938006e-05
4,390 LogStore: A Cloud-Native and Multi-Tenant Log Database 2021 SIGMOD 6.2279149e-05
4,495 ClickHouse - Lightning Fast Analytics for Everyone 2024 VLDB 6.1410277e-05
4,530 Big Metadata: When Metadata is Big Data 2021 VLDB 6.1075429e-05
4,717 Cloud Analytics Benchmark 2023 VLDB 5.9751539e-05
4,956 Dimensions Based Data Clustering and Zone Maps 2017 VLDB 5.8040891e-05
5,113 Columnstore and B+ tree – Are Hybrid Physical Designs Important? 2018 SIGMOD 5.687445e-05
5,119 Design Tradeoffs of Data Access Methods 2016 SIGMOD 5.6807904e-05
5,315 Cuckoo Index: A Lightweight Secondary Index Structure 2020 VLDB 5.5723424e-05
5,749 BinDex: A Two-Layered Index for Fast and Robust Scans 2020 SIGMOD 5.3418923e-05
5,790 AQWA: Adaptive Query-Workload-Aware Partitioning of Big Spatial Data 2015 VLDB 5.3269734e-05
5,791 Dissecting, Designing, and Optimizing LSM-based Data Stores 2022 SIGMOD 5.3268999e-05
6,340 Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine 2024 SIGMOD 5.1051018e-05
6,466 Pando: Enhanced Data Skipping with Logical Data Partitioning 2023 VLDB 5.0528281e-05
6,972 Predicate Caching: Query-Driven Secondary Indexing for Cloud Data Warehouses 2024 SIGMOD 4.8785237e-05
7,053 Statisticum: Data Statistics Management in SAP HANA 2017 VLDB 4.8497195e-05
7,483 RTScan: Efficient Scan with Ray Tracing Cores 2024 VLDB 4.7180617e-05
7,876 Two Birds With One Stone: Designing a Hybrid Cloud Storage Engine for HTAP 2024 VLDB 4.6298182e-05
7,907 Petabyte-Scale Row-Level Operations in Data Lakehouses 2024 VLDB 4.6205839e-05
8,222 Sieve: A Learned Data-Skipping Index for Data Analytics 2023 VLDB 4.5555621e-05
8,225 Automated Multidimensional Data Layouts in Amazon Redshift 2024 SIGMOD 4.555289e-05
8,415 Pruning in Snowflake: Working Smarter, Not Harder 2025 SIGMOD 4.5197687e-05
8,430 Tree-Encoded Bitmaps 2020 SIGMOD 4.5154973e-05
8,447 Cabin: a Compressed Adaptive Binned Scan Index 2024 SIGMOD 4.5102052e-05
8,487 Adaptive Compression for Fast Scans on String Columns 2021 SIGMOD 4.4999394e-05
8,718 Parachute: Single-Pass Bi-Directional Information Passing 2025 VLDB 4.4612599e-05
8,758 Hyperspace: The Indexing Subsystem of Azure Synapse 2021 VLDB 4.456315e-05
Previous Page 1 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers