Database Paper Browser

Back to papers

Marigold: Efficient k-means Clustering in High Dimensions

Summary: Marigold accelerates k-means in high dimensions by aggressively pruning distance computations via a tight distance‑bounding scheme, stepwise multiresolution transforms, and triangle‑inequality exploitation. Novel combination yields near real‑time clustering (≈10× speedup on ARPES and other real-world datasets) without degrading k‑means accuracy. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13032
Venue
VLDB
Year
2023
Pagerank
4.4715132e-05
Overall Rank
8,670 | 39.69%
DOI
10.14778/3587136.3587147

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Rank Citing Paper Year Venue Pagerank
10,582 A Flexible Framework for Query-oriented Interactive Community Search 2025 VLDB 4.1945683e-05
10,716 Federated and Balanced Clustering for High-dimensional Data 2025 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 3 of 3 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1,241 Multi-dimensional Selectivity Estimation Using Compressed Histogram Information 1999 SIGMOD 0.00013097578
2,093 Scalable K-Means++ 2012 VLDB 9.5588104e-05
4,652 On the Efficiency of K-Means Clustering: Evaluation, Optimization, and Algorithm Selection 2021 VLDB 6.0228549e-05
Previous Page 1 / 1 Next

Semantically Similar Papers