Database Paper Browser

Back to papers

Beyond Compression: A Comprehensive Evaluation of Lossless Floating-Point Compression

Summary: Comprehensive empirical evaluation of lossless floating-point compressors for columnar engines, measuring compression ratio, in-situ DB query performance, and ML queries (distance, k-NN/RAG); implemented in Rust and released as a library. Finds clear trade-offs: no single method dominates both space and query speed—some compressors yield higher compression but slower DB/ML query performance. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
14054
Venue
VLDB
Year
2025
Pagerank
4.1945683e-05
Overall Rank
10,741 | 25.28%
DOI
10.14778/3749646.3749701

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 1 of 1 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 36 of 36 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
131 Integrating Compression and Execution in Column-Oriented Database Systems 2006 SIGMOD 0.0004370331
210 Gorilla: A Fast, Scalable, In-Memory Time Series Database 2015 VLDB 0.0003404384
1,270 BitWeaving: Fast Scans for Main Memory Data Processing 2013 SIGMOD 0.00012926086
1,516 k-Shape: Efficient and Accurate Clustering of Time Series 2015 SIGMOD 0.00011586255
1,590 Column-oriented Database Systems 2009 VLDB 0.00011233838
2,029 SAND: Streaming Subsequence Anomaly Detection 2021 VLDB 9.740868e-05
2,064 Chimp: Efficient Lossless Floating Point Compression for Time Series Databases 2022 VLDB 9.6418929e-05
2,381 TSB-UAD: An End-to-End Benchmark Suite for Univariate Time-Series Anomaly Detection 2022 VLDB 8.9327638e-05
2,390 ByteSlice: Pushing the Envelop of Main Memory Data Processing with a New Storage Layout 2015 SIGMOD 8.9084657e-05
2,613 Decomposed Bounded Floats for Fast Compression and Queries 2021 VLDB 8.4503824e-05
3,644 BtrBlocks: Efficient Columnar Compression for Data Lakes 2023 SIGMOD 6.8854928e-05
3,943 Volume Under the Surface: A New Accuracy Evaluation Measure for Time-Series Anomaly Detection 2022 VLDB 6.6099833e-05
4,059 GRAIL: Efficient Time-Series Representation Learning 2019 VLDB 6.4854417e-05
4,079 Choose Wisely: An Extensive Evaluation of Model Selection for Anomaly Detection in Time Series 2023 VLDB 6.4663636e-05
4,392 Elf: Erasing-based Lossless Floating-Point Compression 2023 VLDB 6.2257087e-05
4,507 ALP: Adaptive Lossless floating-Point Compression 2023 SIGMOD 6.131017e-05
4,518 The FastLanes Compression Layout: Decoding >100 Billion Integers per Second with Scalar Code 2023 VLDB 6.117844e-05
4,853 Debunking Four Long-Standing Misconceptions of Time-Series Distance Measures 2020 SIGMOD 5.8760276e-05
5,040 Tile-based Lightweight Integer Compression in GPU 2022 SIGMOD 5.7425187e-05
5,562 A Deep Dive into Common Open Formats for Analytical DBMSs 2023 VLDB 5.4331334e-05
6,311 VergeDB: A Database for IoT Analytics on Edge Devices 2021 CIDR 5.1161316e-05
6,367 Good to the Last Bit: Data-Driven Encoding with CodecDB 2021 SIGMOD 5.0941072e-05
7,395 MOST: Model-Based Compression with Outlier Storage for Time Series Data 2023 SIGMOD 4.7420041e-05
7,429 CompressDB: Enabling Efficient Compressed Data Direct Processing for Various Databases 2022 SIGMOD 4.7320139e-05
8,088 PIDS: Attribute Decomposition for Improved Compression and Query Performance in Columnar Storage 2020 VLDB 4.5897316e-05
8,588 FCBench: Cross-Domain Benchmarking of Lossless Compression for Floating-Point Data 2024 VLDB 4.4900555e-05
9,294 Theseus: Navigating the Labyrinth of Time-Series Anomaly Detection 2022 VLDB 4.3608061e-05
9,329 Odyssey: An Engine Enabling The Time-Series Clustering Journey 2023 VLDB 4.3556432e-05
9,599 SPARTAN: Data-Adaptive Symbolic Time-Series Approximation 2025 SIGMOD 4.3177432e-05
10,466 A Structured Study of Multivariate Time-Series Distance Measures 2025 SIGMOD 4.1945683e-05
10,718 BURST: Rendering Clustering Techniques Suitable for Evolving Streams 2025 VLDB 4.1945683e-05
10,738 TSB-AutoAD: Towards Automated Solutions for Time-Series Anomaly Detection 2025 VLDB 4.1945683e-05
10,739 Time-Series Clustering: A Comprehensive Study of Data Mining, Machine Learning, and Deep Learning Methods 2025 VLDB 4.1945683e-05
11,094 Time-Series Anomaly Detection: Overview and New Trends 2024 VLDB 4.1945683e-05
11,235 Accelerating Similarity Search for Elastic Measures: A Study and New Generalization of Lower Bounding Distances 2023 VLDB 4.1945683e-05
13,261 SAND in Action: Subsequence Anomaly Detection for Streams 2021 VLDB -
Previous Page 1 / 1 Next

Semantically Similar Papers