PIDS: Attribute Decomposition for Improved Compression and Query Performance in Columnar Storage
Summary: PIDS uses unsupervised pattern inference to decompose string attributes into sub-attrs in columnar store, enabling per-attr encoding and compression near Snappy/Gzip. Pushdown to sub-attrs reduces I/O and comparisons, yielding faster query execution. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Hao Jiang
- 2. Chunwei Liu
- 3. Qi Jin
- 4. John Paparrizos
- 5. Aaron J. Elmore
Incoming Citations (Sorted by Pagerank)
Showing 21 of 21 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 12 of 12 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,613 | Decomposed Bounded Floats for Fast Compression and Queries | 2021 | VLDB | 8.4503824e-05 |
| 1,598 | Semantic Compression and Pattern Extraction with Fascicles | 1999 | VLDB | 0.00011202905 |
| 9,595 | High-Ratio Compression for Machine-Generated Data | 2023 | SIGMOD | 4.3194469e-05 |
| 2,229 | Self-organizing Tuple Reconstruction in Column-stores | 2009 | SIGMOD | 9.2350274e-05 |
| 5,898 | Column Partition and Permutation for Run Length Encoding in Columnar Databases | 2020 | SIGMOD | 5.2839046e-05 |
| 131 | Integrating Compression and Execution in Column-Oriented Database Systems | 2006 | SIGMOD | 0.0004370331 |
| 1,949 | Positional Update Handling in Column Stores | 2010 | SIGMOD | 9.9864085e-05 |
| 9,665 | Fingerprints for Compressed Columnar Data Search | 2019 | SIGMOD | 4.3082524e-05 |
| 1,100 | Query Optimization In Compressed Database Systems | 2001 | SIGMOD | 0.00014072277 |
| 1,134 | Dictionary-based Order-preserving String Compression for Main Memory Column Stores | 2009 | SIGMOD | 0.00013761456 |