DeepSqueeze: Deep Semantic Compression for Tabular Data
Summary: DeepSqueeze uses autoencoders to map tabular rows to a compact latent space, capturing cross-column dependencies beyond per-column encodings. It provides bounds for lossy numerical compression and works with columnar formats, achieving ~4x reduction. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Amir Ilkhechi
- 2. Andrew Crotty
- 3. Alex Galakatos
- 4. Yicong Mao
- 5. Grace Fan
- 6. Xiran Shi
- 7. Ugur Cetintemel
Incoming Citations (Sorted by Pagerank)
Showing 16 of 16 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 21 | C-Store: A Column-oriented DBMS | 2005 | VLDB | 0.00086087497 |
| 131 | Integrating Compression and Execution in Column-Oriented Database Systems | 2006 | SIGMOD | 0.0004370331 |
| 310 | The Vertica Analytic Database: C-Store 7 Years Later | 2012 | VLDB | 0.00028132402 |
| 1,598 | Semantic Compression and Pattern Extraction with Fascicles | 1999 | VLDB | 0.00011202905 |
| 2,134 | How to Wring a Table Dry: Entropy Compression of Relations and Querying of Compressed Relations | 2006 | VLDB | 9.4741038e-05 |
| 2,251 | Vizdom: Interactive Analytics through Pen and Touch | 2015 | VLDB | 9.1986441e-05 |
| 2,908 | SPARTAN: A Model-Based Semantic Compression System for Massive Data Tables | 2001 | SIGMOD | 7.9306333e-05 |
| 4,030 | Revisiting Reuse for Approximate Query Processing | 2017 | VLDB | 6.5129665e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,967 | Compressed Linear Algebra for Large-Scale Machine Learning | 2016 | VLDB | 9.9131712e-05 |
| 10,281 | GPU Acceleration of SQL Analytics on Compressed Data | 2026 | VLDB | 4.1945683e-05 |
| 3,335 | DeepJoin: Joinable Table Discovery with Pre-trained Language Models | 2023 | VLDB | 7.2065006e-05 |
| 1,100 | Query Optimization In Compressed Database Systems | 2001 | SIGMOD | 0.00014072277 |
| 3,536 | General purpose database summarization | 2005 | VLDB | 6.9990821e-05 |
| 3,787 | White-box Compression: Learning and Exploiting Compact Table Representations | 2020 | CIDR | 6.7674374e-05 |
| 7,429 | CompressDB: Enabling Efficient Compressed Data Direct Processing for Various Databases | 2022 | SIGMOD | 4.7320139e-05 |
| 2,908 | SPARTAN: A Model-Based Semantic Compression System for Massive Data Tables | 2001 | SIGMOD | 7.9306333e-05 |
| 6,894 | TableDC: Deep Clustering for Tabular Data | 2025 | SIGMOD | 4.8925595e-05 |
| 9,408 | Experimental Analysis of Large-scale Learnable Vector Storage Compression | 2024 | VLDB | 4.3441378e-05 |