Database Paper Browser

Back to papers

PIDS: Attribute Decomposition for Improved Compression and Query Performance in Columnar Storage

Summary: PIDS uses unsupervised pattern inference to decompose string attributes into sub-attrs in columnar store, enabling per-attr encoding and compression near Snappy/Gzip. Pushdown to sub-attrs reduces I/O and comparisons, yielding faster query execution. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12284
Venue
VLDB
Year
2020
Pagerank
4.5897316e-05
Overall Rank
8,088 | 43.74%
DOI
10.14778/3380750.3380761

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 21 of 21 citing papers.

Rank Citing Paper Year Venue Pagerank
2,381 TSB-UAD: An End-to-End Benchmark Suite for Univariate Time-Series Anomaly Detection 2022 VLDB 8.9327638e-05
2,613 Decomposed Bounded Floats for Fast Compression and Queries 2021 VLDB 8.4503824e-05
3,416 LeCo: Lightweight Compression via Learning Serial Correlations 2024 SIGMOD 7.1196234e-05
3,943 Volume Under the Surface: A New Accuracy Evaluation Measure for Time-Series Anomaly Detection 2022 VLDB 6.6099833e-05
4,079 Choose Wisely: An Extensive Evaluation of Model Selection for Anomaly Detection in Time Series 2023 VLDB 6.4663636e-05
5,562 A Deep Dive into Common Open Formats for Analytical DBMSs 2023 VLDB 5.4331334e-05
6,311 VergeDB: A Database for IoT Analytics on Edge Devices 2021 CIDR 5.1161316e-05
6,367 Good to the Last Bit: Data-Driven Encoding with CodecDB 2021 SIGMOD 5.0941072e-05
9,294 Theseus: Navigating the Labyrinth of Time-Series Anomaly Detection 2022 VLDB 4.3608061e-05
9,329 Odyssey: An Engine Enabling The Time-Series Clustering Journey 2023 VLDB 4.3556432e-05
9,595 High-Ratio Compression for Machine-Generated Data 2023 SIGMOD 4.3194469e-05
9,599 SPARTAN: Data-Adaptive Symbolic Time-Series Approximation 2025 SIGMOD 4.3177432e-05
9,645 The FastLanes File Format 2025 VLDB 4.3109001e-05
10,466 A Structured Study of Multivariate Time-Series Distance Measures 2025 SIGMOD 4.1945683e-05
10,524 Understanding the Black Box: A Deep Empirical Dive into Shapley Value Approximations for Tabular Data 2025 SIGMOD 4.1945683e-05
10,674 Improving Time Series Data Compression in Apache IoTDB 2025 VLDB 4.1945683e-05
10,738 TSB-AutoAD: Towards Automated Solutions for Time-Series Anomaly Detection 2025 VLDB 4.1945683e-05
10,739 Time-Series Clustering: A Comprehensive Study of Data Mining, Machine Learning, and Deep Learning Methods 2025 VLDB 4.1945683e-05
10,741 Beyond Compression: A Comprehensive Evaluation of Lossless Floating-Point Compression 2025 VLDB 4.1945683e-05
11,094 Time-Series Anomaly Detection: Overview and New Trends 2024 VLDB 4.1945683e-05
11,235 Accelerating Similarity Search for Elastic Measures: A Study and New Generalization of Lower Bounding Distances 2023 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 12 of 12 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers