Database Paper Browser

Back to papers

AWARE: Workload-aware, Redundancy-exploiting Linear Algebra

Summary: Introduces AWARE, a workload-aware compression framework for ML pipelines that summarizes their workload and optimizes compression plus execution plans to minimize runtime. It exploits redundancy beyond sparsity with new schemes and kernels, delivering up to 10,000x per-op and 6.6x ML gains over uncompressed baselines. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6505
Venue
SIGMOD
Year
2023
Pagerank
4.4521262e-05
Overall Rank
8,786 | 38.88%
DOI
10.1145/3588682

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 3 of 3 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 38 of 38 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
131 Integrating Compression and Execution in Column-Oriented Database Systems 2006 SIGMOD 0.0004370331
241 DB2 with BLU Acceleration: So Much More than Just a Column Store 2013 VLDB 0.00031420034
305 SIMD-Scan: Ultra Fast in-Memory Table Scan using on-Chip Vector Processing Units 2009 VLDB 0.00028248614
368 Small Materialized Aggregates: A Light Weight Index Structure for Data Warehousing 1998 VLDB 0.000254931
557 SystemML: Declarative Machine Learning on Spark 2016 VLDB 0.00020197988
734 The TileDB Array Data Storage Manager 2017 VLDB 0.00017455248
834 Learning Linear Regression Models over Factorized Joins 2016 SIGMOD 0.00016135159
1,134 Dictionary-based Order-preserving String Compression for Main Memory Column Stores 2009 SIGMOD 0.00013761456
1,167 Learning Generalized Linear Models Over Normalized Data 2015 SIGMOD 0.00013547713
1,263 Data Blocks: Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation 2016 SIGMOD 0.00012982857
1,279 Towards Linear Algebra over Normalized Data 2017 VLDB 0.00012868394
1,590 Column-oriented Database Systems 2009 VLDB 0.00011233838
1,611 Qd-tree: Learning Data Layouts for Big Data Analytics 2020 SIGMOD 0.00011147324
1,905 How to Barter Bits for Chronons: Compression and Bandwidth Trade Offs for Database Scans 2007 SIGMOD 0.00010138448
1,967 Compressed Linear Algebra for Large-Scale Machine Learning 2016 VLDB 9.9131712e-05
2,064 Chimp: Efficient Lossless Floating Point Compression for Time Series Databases 2022 VLDB 9.6418929e-05
2,134 How to Wring a Table Dry: Entropy Compression of Relations and Querying of Compressed Relations 2006 VLDB 9.4741038e-05
2,456 Production Machine Learning Pipelines: Empirical Analysis and Optimization Opportunities 2021 SIGMOD 8.7733773e-05
2,613 Decomposed Bounded Floats for Fast Compression and Queries 2021 VLDB 8.4503824e-05
3,277 A Layered Aggregate Engine for Analytics Workloads 2019 SIGMOD 7.2871625e-05
3,488 Optimal Column Layout for Hybrid Workloads 2019 VLDB 7.0479329e-05
3,745 DeepSqueeze: Deep Semantic Compression for Tabular Data 2020 SIGMOD 6.7926132e-05
3,798 Plato: Approximate Analytics over Compressed Time Series with Tight Deterministic Error Guarantees 2020 VLDB 6.7592302e-05
3,808 SketchML: Accelerating Distributed Machine Learning with Data Sketches 2018 SIGMOD 6.7455428e-05
3,918 On Optimizing Operator Fusion Plans for Large-Scale Machine Learning in SystemML 2018 VLDB 6.6315176e-05
4,787 The Relational Data Borg is Learning 2020 VLDB 5.9224501e-05
4,833 MNC: Structure-Exploiting Sparsity Estimation for Matrix Expressions 2019 SIGMOD 5.8916346e-05
5,123 Accelerating Generalized Linear Models with MLWeaving: A One-Size-Fits-All System for Any-Precision Learning 2019 VLDB 5.6796998e-05
5,806 BlinkML: Efficient Maximum Likelihood Estimation with Probabilistic Guarantees 2019 SIGMOD 5.3200643e-05
6,057 Progressive Compressed Records: Taking a Byte out of Deep Learning Data 2021 VLDB 5.2317752e-05
6,157 Compression Aware Physical Database Design 2011 VLDB 5.1801143e-05
6,191 Automatic Optimization of Matrix Implementations for Distributed Machine Learning and Linear Algebra 2021 SIGMOD 5.1642282e-05
6,538 Tuple-oriented Compression for Large-scale Mini-batch Stochastic Gradient Descent 2019 SIGMOD 5.023239e-05
7,335 MorphStore: Analytical Query Engine with a Holistic Compression-Enabled Processing Model 2020 VLDB 4.7603723e-05
7,704 ExDRa: Exploratory Data Science on Federated Raw Data 2021 SIGMOD 4.6733838e-05
8,578 Robust and Budget-Constrained Encoding Configurations for In-Memory Database Systems 2022 VLDB 4.4923477e-05
8,657 Improving Matrix-vector Multiplication via Lossless Grammar-Compressed Matrices 2022 VLDB 4.4730648e-05
9,265 COMET: A Novel Memory-Efficient Deep Learning Training Framework by Using Error-Bounded Lossy Compression 2022 VLDB 4.3667558e-05
Previous Page 1 / 1 Next

Semantically Similar Papers