Back to papers
Dumpy: A Compact and Adaptive Index for Large Data Series Collections
Summary: Dumpy is a compact, adaptive multi-ary index for large data-series collections, enabling fast index building and high-accuracy search. By addressing iSAX limitations—proximity-compactness trade-offs and skew—via adaptive node splitting and Dumpy-Fuzzy duplication, it achieves better efficiency, scalability, and accuracy.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 6614
- Venue
- SIGMOD
- Year
- 2023
- Pagerank
- 4.8350023e-05
- Overall Rank
- 7,095 | 50.65%
- DOI
-
10.1145/3588965
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 14 of 14 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 2,324 |
RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search |
2024 |
SIGMOD |
9.0326444e-05 |
| 3,400 |
ELPIS: Graph-Based Similarity Search for Scalable Data Science |
2023 |
VLDB |
7.1405533e-05 |
| 6,376 |
DET-LSH: A Locality-Sensitive Hashing Scheme with Dynamic Encoding Tree for Approximate Nearest Neighbor Search |
2024 |
VLDB |
5.0916875e-05 |
| 7,316 |
Steiner-Hardness: A Query Hardness Measure for Graph-Based ANN Indexes |
2024 |
VLDB |
4.7640297e-05 |
| 7,843 |
Subspace Collision: An Efficient and Accurate Framework for High-dimensional Approximate Nearest Neighbor Search |
2025 |
SIGMOD |
4.6367909e-05 |
| 9,206 |
Odyssey: A Journey in the Land of Distributed Data Series Similarity Search |
2023 |
VLDB |
4.373492e-05 |
| 9,230 |
LeaFi: Data Series Indexes on Steroids with Learned Filters |
2025 |
SIGMOD |
4.3690661e-05 |
| 9,247 |
iEDeaL: A Deep Learning Framework for Detecting Highly Imbalanced Interictal Epileptiform Discharges |
2023 |
VLDB |
4.3690661e-05 |
| 9,291 |
DARTH: Declarative Recall Through Early Termination for Approximate Nearest Neighbor Search |
2026 |
SIGMOD |
4.3619549e-05 |
| 9,822 |
DIDS: Double Indices and Double Summarizations for Fast Similarity Search |
2024 |
VLDB |
4.2757088e-05 |
| 10,711 |
Cracking Vector Search Indexes |
2025 |
VLDB |
4.1945683e-05 |
| 10,833 |
Cardinality Estimation for Similarity Search on High-Dimensional Data Objects: The Impact of Reference Objects |
2025 |
VLDB |
4.1945683e-05 |
| 10,884 |
Representative Time Series Discovery for Data Exploration |
2025 |
VLDB |
4.1945683e-05 |
| 11,022 |
CIVET: Exploring Compact Index for Variable-Length Subsequence Matching on Time Series |
2024 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 20 of 20 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 212 |
Fast Approximate Nearest Neighbor Search With The Navigating Spreading-out Graph |
2019 |
VLDB |
0.00033913475 |
| 562 |
Query-Aware Locality-Sensitive Hashing for Approximate Nearest Neighbor Search |
2016 |
VLDB |
0.00020091752 |
| 770 |
A Comprehensive Survey and Experimental Comparison of Graph-Based Approximate Nearest Neighbor Search |
2021 |
VLDB |
0.00016917602 |
| 867 |
SRS: Solving c-Approximate Nearest Neighbor Queries in High Dimensional Euclidean Space with a Tiny Index |
2015 |
VLDB |
0.00015792021 |
| 1,010 |
HD-Index: Pushing the Scalability-Accuracy Boundary for Approximate kNN Search in High-Dimensional Spaces |
2018 |
VLDB |
0.00014652858 |
| 1,157 |
A Data-adaptive and Dynamic Segmentation Index for Whole Matching on Time Series |
2013 |
VLDB |
0.00013610658 |
| 1,161 |
Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures |
2008 |
VLDB |
0.00013585236 |
| 2,435 |
iDEC: Indexable Distance Estimating Codes for Approximate Nearest Neighbor Search |
2020 |
VLDB |
8.8252237e-05 |
| 2,644 |
Series2Graph: Graph-based Subsequence Anomaly Detection for Time Series |
2020 |
VLDB |
8.3832357e-05 |
| 3,029 |
A Decade of Progress in Indexing and Mining Large Time Series Databases |
2006 |
VLDB |
7.6803666e-05 |
| 3,183 |
Return of the Lernaean Hydra: Experimental Evaluation of Data Series Approximate Similarity Search |
2020 |
VLDB |
7.4228241e-05 |
| 3,400 |
ELPIS: Graph-Based Similarity Search for Scalable Data Science |
2023 |
VLDB |
7.1405533e-05 |
| 3,540 |
Scalable, Variable-Length Similarity Search in Data Series: The ULISSE Approach |
2018 |
VLDB |
6.9943185e-05 |
| 3,629 |
The Lernaean Hydra of Data Series Similarity Search: An Experimental Evaluation of the State of the Art |
2019 |
VLDB |
6.902069e-05 |
| 4,755 |
Indexing for Interactive Exploration of Big Data Series |
2014 |
SIGMOD |
5.946863e-05 |
| 5,158 |
Coconut: A Scalable Bottom-Up Approach for Building Data Series Indexes |
2018 |
VLDB |
5.6588553e-05 |
| 5,738 |
Hercules Against Data Series Similarity Search |
2022 |
VLDB |
5.3478528e-05 |
| 8,712 |
ANN Softmax: Acceleration of Extreme Classification Training |
2022 |
VLDB |
4.4626362e-05 |
| 9,206 |
Odyssey: A Journey in the Land of Distributed Data Series Similarity Search |
2023 |
VLDB |
4.373492e-05 |
| 9,247 |
iEDeaL: A Deep Learning Framework for Detecting Highly Imbalanced Interictal Epileptiform Discharges |
2023 |
VLDB |
4.3690661e-05 |
Semantically Similar Papers