On Reducing Space Amplification with Multi-Column Compaction in Apache IoTDB
Summary: Shows out‑of‑order inserts, updates and overlapping bitmaps in multi‑column time‑series worsen LSM-tree space amplification in Apache IoTDB and proves optimal compaction file selection is hard. Proposes Multi‑Column Compaction: a heuristic file‑selection with File Prefetcher and Compaction Cache, implemented in IoTDB, that significantly reduces space amplification in experiments. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Chenguang Fang
- 2. Zijie Chen
- 3. Shaoxu Song
- 4. Xiangdong Huang
- 5. Chen Wang
- 6. Jianmin Wang
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 16 of 16 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,967 | Apache IoTDB: A Time Series Database for IoT Applications | 2023 | SIGMOD | 6.5796647e-05 |
| 10,379 | In-Database Time Series Clustering | 2025 | SIGMOD | 4.1945683e-05 |
| 9,386 | Rethinking The Compaction Policies in LSM-trees | 2025 | SIGMOD | 4.3455975e-05 |
| 8,434 | Time Series Representation for Visualization in Apache IoTDB | 2024 | SIGMOD | 4.5141748e-05 |
| 10,575 | Migration-Free Elastic Storage of Time Series in Apache IoTDB | 2025 | VLDB | 4.1945683e-05 |
| 5,071 | Time Series Data Encoding for Efficient Storage: A Comparative Analysis in Apache IoTDB | 2022 | VLDB | 5.7188461e-05 |
| 3,793 | Constructing and Analyzing the LSM Compaction Design Space | 2021 | VLDB | 6.7617833e-05 |
| 9,794 | Distance-based Outlier Query Optimization in Apache IoTDB | 2024 | VLDB | 4.2818172e-05 |
| 11,175 | Grouping Time Series for Efficient Columnar Storage | 2023 | SIGMOD | 4.1945683e-05 |
| 10,674 | Improving Time Series Data Compression in Apache IoTDB | 2025 | VLDB | 4.1945683e-05 |