Back to papers
Grouping Time Series for Efficient Columnar Storage
Summary: Groups time series by sharing a single time column to reduce timestamp repetition, at the expense of potential nulls. NP-hard to optimize; propose heuristic grouping, deployed in Apache IoTDB, with storage comparable to single-column schemes.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 6526
- Venue
- SIGMOD
- Year
- 2023
- Pagerank
- 4.1945683e-05
- Overall Rank
- 11,175 | 22.26%
- DOI
-
10.1145/3588703
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 210 |
Gorilla: A Fast, Scalable, In-Memory Time Series Database |
2015 |
VLDB |
0.0003404384 |
| 266 |
Efficient Exact Set-Similarity Joins |
2006 |
VLDB |
0.00029718727 |
| 1,396 |
Can We Beat the Prefix Filtering? An Adaptive Framework for Similarity Join and Search |
2012 |
SIGMOD |
0.00012204748 |
| 1,921 |
Apache IoTDB: Time-series Database for Internet of Things |
2020 |
VLDB |
0.00010082827 |
| 3,208 |
Column-Oriented Storage Techniques for MapReduce |
2011 |
VLDB |
7.3781897e-05 |
| 4,012 |
Columnar Storage and List-based Processing for Graph Database Management Systems |
2021 |
VLDB |
6.5335884e-05 |
| 6,596 |
Heracles: An Efficient Storage Model and Data Flushing for Performance Monitoring Timeseries |
2021 |
VLDB |
4.9988301e-05 |
| 7,112 |
Wide Table Layout Optimization based on Column Ordering and Duplication |
2017 |
SIGMOD |
4.8275068e-05 |
| 7,128 |
Jigsaw: A Data Storage and Query Processing Engine for Irregular Table Partitioning |
2021 |
SIGMOD |
4.8230171e-05 |
| 11,881 |
Cleaning Timestamps with Temporal Constraints |
2016 |
VLDB |
4.1945683e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 7,112 |
Wide Table Layout Optimization based on Column Ordering and Duplication |
2017 |
SIGMOD |
4.8275068e-05 |
| 6,596 |
Heracles: An Efficient Storage Model and Data Flushing for Performance Monitoring Timeseries |
2021 |
VLDB |
4.9988301e-05 |
| 7,168 |
TimeUnion: An Efficient Architecture with Unified Data Model for Timeseries Management Systems on Hybrid Cloud Storage |
2022 |
SIGMOD |
4.8121704e-05 |
| 1,590 |
Column-oriented Database Systems |
2009 |
VLDB |
0.00011233838 |
| 9,048 |
On Repairing Timestamps for Regular Interval Time Series |
2022 |
VLDB |
4.4039656e-05 |
| 5,898 |
Column Partition and Permutation for Run Length Encoding in Columnar Databases |
2020 |
SIGMOD |
5.2839046e-05 |
| 10,674 |
Improving Time Series Data Compression in Apache IoTDB |
2025 |
VLDB |
4.1945683e-05 |
| 10,379 |
In-Database Time Series Clustering |
2025 |
SIGMOD |
4.1945683e-05 |
| 11,049 |
On Reducing Space Amplification with Multi-Column Compaction in Apache IoTDB |
2024 |
VLDB |
4.1945683e-05 |
| 5,071 |
Time Series Data Encoding for Efficient Storage: A Comparative Analysis in Apache IoTDB |
2022 |
VLDB |
5.7188461e-05 |