SPARTAN: A Model-Based Semantic Compression System for Massive Data Tables
Summary: SPARTAN stores a subset of attributes and predicts the rest with CaRT models under error bounds. It leverages attribute semantics and data-mining optimization to select predictors, enabling compression and approximate queries from small samples. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Shivnath Babu
- 2. Minos Garofalakis
- 3. Rajeev Rastogi
Incoming Citations (Sorted by Pagerank)
Showing 12 of 12 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,455 | RainForest - A Framework for Fast Decision Tree Construction of Large Datasets | 1998 | VLDB | 0.00011899821 |
| 1,598 | Semantic Compression and Pattern Extraction with Fascicles | 1999 | VLDB | 0.00011202905 |
| 4,685 | PUBLIC: A Decision Tree Classifier that Integrates Building and Pruning | 1998 | VLDB | 5.9994771e-05 |
| 5,020 | Efficient Construction of Regression Trees with Range and Region Splitting | 1997 | VLDB | 5.7552641e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,468 | Comprehensive and Efficient Workload Compression | 2021 | VLDB | 6.1584035e-05 |
| 2,134 | How to Wring a Table Dry: Entropy Compression of Relations and Querying of Compressed Relations | 2006 | VLDB | 9.4741038e-05 |
| 9,595 | High-Ratio Compression for Machine-Generated Data | 2023 | SIGMOD | 4.3194469e-05 |
| 7,429 | CompressDB: Enabling Efficient Compressed Data Direct Processing for Various Databases | 2022 | SIGMOD | 4.7320139e-05 |
| 1,100 | Query Optimization In Compressed Database Systems | 2001 | SIGMOD | 0.00014072277 |
| 3,497 | A New Compression Method with Fast Searching on Large Databases | 1987 | VLDB | 7.0390264e-05 |
| 3,787 | White-box Compression: Learning and Exploiting Compact Table Representations | 2020 | CIDR | 6.7674374e-05 |
| 3,536 | General purpose database summarization | 2005 | VLDB | 6.9990821e-05 |
| 3,745 | DeepSqueeze: Deep Semantic Compression for Tabular Data | 2020 | SIGMOD | 6.7926132e-05 |
| 9,599 | SPARTAN: Data-Adaptive Symbolic Time-Series Approximation | 2025 | SIGMOD | 4.3177432e-05 |