Back to papers
Decomposed Bounded Floats for Fast Compression and Queries
Summary: Buff introduces decomposed columnar encoding for bounded, low-precision floats, delivering strong compression for data-intensive workloads. Fast ingestion and SIMD-accelerated in-situ adaptive queries exploit the decomposed representation to enable high-speed analytics on such data.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 12433
- Venue
- VLDB
- Year
- 2021
- Pagerank
- 8.4503824e-05
- Overall Rank
- 2,613 | 81.83%
- DOI
-
10.14778/3476249.3476305
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 27 of 27 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 2,381 |
TSB-UAD: An End-to-End Benchmark Suite for Univariate Time-Series Anomaly Detection |
2022 |
VLDB |
8.9327638e-05 |
| 3,943 |
Volume Under the Surface: A New Accuracy Evaluation Measure for Time-Series Anomaly Detection |
2022 |
VLDB |
6.6099833e-05 |
| 4,079 |
Choose Wisely: An Extensive Evaluation of Model Selection for Anomaly Detection in Time Series |
2023 |
VLDB |
6.4663636e-05 |
| 4,392 |
Elf: Erasing-based Lossless Floating-Point Compression |
2023 |
VLDB |
6.2257087e-05 |
| 5,562 |
A Deep Dive into Common Open Formats for Analytical DBMSs |
2023 |
VLDB |
5.4331334e-05 |
| 6,859 |
Frequency Domain Data Encoding in Apache IoTDB |
2023 |
VLDB |
4.905867e-05 |
| 7,395 |
MOST: Model-Based Compression with Outlier Storage for Time Series Data |
2023 |
SIGMOD |
4.7420041e-05 |
| 8,373 |
Hierarchical Residual Encoding for Multiresolution Time Series Compression |
2023 |
SIGMOD |
4.5329467e-05 |
| 8,588 |
FCBench: Cross-Domain Benchmarking of Lossless Compression for Floating-Point Data |
2024 |
VLDB |
4.4900555e-05 |
| 8,698 |
Everything You Always Wanted to Know About Storage Compressibility of Pre-Trained ML Models but Were Afraid to Ask |
2024 |
VLDB |
4.4657846e-05 |
| 8,786 |
AWARE: Workload-aware, Redundancy-exploiting Linear Algebra |
2023 |
SIGMOD |
4.4521262e-05 |
| 9,149 |
Serf: Streaming Error-Bounded Floating-Point Compression |
2025 |
SIGMOD |
4.3849295e-05 |
| 9,294 |
Theseus: Navigating the Labyrinth of Time-Series Anomaly Detection |
2022 |
VLDB |
4.3608061e-05 |
| 9,329 |
Odyssey: An Engine Enabling The Time-Series Clustering Journey |
2023 |
VLDB |
4.3556432e-05 |
| 9,404 |
Revisiting B-tree Compression: An Experimental Study |
2024 |
SIGMOD |
4.3441378e-05 |
| 10,321 |
DeXOR: Enabling xor in Decimal Space for Streaming Lossless Compression of Floating-point Data |
2026 |
VLDB |
4.1945683e-05 |
| 10,466 |
A Structured Study of Multivariate Time-Series Distance Measures |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,524 |
Understanding the Black Box: A Deep Empirical Dive into Shapley Value Approximations for Tabular Data |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,614 |
QPET: A Versatile and Portable Quantity-of-Interest-Preservation Framework for Error-Bounded Lossy Compression |
2025 |
VLDB |
4.1945683e-05 |
| 10,674 |
Improving Time Series Data Compression in Apache IoTDB |
2025 |
VLDB |
4.1945683e-05 |
| 10,698 |
Not Small Enough? SegPQ: A Learned Approach to Compress Product Quantization Codebooks |
2025 |
VLDB |
4.1945683e-05 |
| 10,738 |
TSB-AutoAD: Towards Automated Solutions for Time-Series Anomaly Detection |
2025 |
VLDB |
4.1945683e-05 |
| 10,739 |
Time-Series Clustering: A Comprehensive Study of Data Mining, Machine Learning, and Deep Learning Methods |
2025 |
VLDB |
4.1945683e-05 |
| 10,741 |
Beyond Compression: A Comprehensive Evaluation of Lossless Floating-Point Compression |
2025 |
VLDB |
4.1945683e-05 |
| 10,854 |
LiquidCache: Efficient Pushdown Caching for Cloud-Native Data Analytics |
2025 |
VLDB |
4.1945683e-05 |
| 11,094 |
Time-Series Anomaly Detection: Overview and New Trends |
2024 |
VLDB |
4.1945683e-05 |
| 11,235 |
Accelerating Similarity Search for Elastic Measures: A Study and New Generalization of Lower Bounding Distances |
2023 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 16 of 16 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 210 |
Gorilla: A Fast, Scalable, In-Memory Time Series Database |
2015 |
VLDB |
0.0003404384 |
| 305 |
SIMD-Scan: Ultra Fast in-Memory Table Scan using on-Chip Vector Processing Units |
2009 |
VLDB |
0.00028248614 |
| 426 |
Amazon Redshift and the Case for Simpler Data Warehouses |
2015 |
SIGMOD |
0.00023594359 |
| 858 |
Efficient Transaction Processing in SAP HANA Database – The End of a Column Store Myth |
2012 |
SIGMOD |
0.000158756 |
| 958 |
Rethinking SIMD Vectorization for In-Memory Databases |
2015 |
SIGMOD |
0.00015045316 |
| 1,270 |
BitWeaving: Fast Scans for Main Memory Data Processing |
2013 |
SIGMOD |
0.00012926086 |
| 1,516 |
k-Shape: Efficient and Accurate Clustering of Time Series |
2015 |
SIGMOD |
0.00011586255 |
| 2,029 |
SAND: Streaming Subsequence Anomaly Detection |
2021 |
VLDB |
9.740868e-05 |
| 2,390 |
ByteSlice: Pushing the Envelop of Main Memory Data Processing with a New Storage Layout |
2015 |
SIGMOD |
8.9084657e-05 |
| 2,616 |
DAQ: A New Paradigm for Approximate Query Processing |
2015 |
VLDB |
8.4471955e-05 |
| 3,608 |
Column Sketches: A Scan Accelerator for Rapid and Robust Predicate Evaluation |
2018 |
SIGMOD |
6.924272e-05 |
| 4,059 |
GRAIL: Efficient Time-Series Representation Learning |
2019 |
VLDB |
6.4854417e-05 |
| 4,853 |
Debunking Four Long-Standing Misconceptions of Time-Series Distance Measures |
2020 |
SIGMOD |
5.8760276e-05 |
| 5,123 |
Accelerating Generalized Linear Models with MLWeaving: A One-Size-Fits-All System for Any-Precision Learning |
2019 |
VLDB |
5.6796998e-05 |
| 6,311 |
VergeDB: A Database for IoT Analytics on Edge Devices |
2021 |
CIDR |
5.1161316e-05 |
| 8,088 |
PIDS: Attribute Decomposition for Improved Compression and Query Performance in Columnar Storage |
2020 |
VLDB |
4.5897316e-05 |
Semantically Similar Papers