Marigold: Efficient k-means Clustering in High Dimensions
Summary: Marigold accelerates k-means in high dimensions by aggressively pruning distance computations via a tight distance‑bounding scheme, stepwise multiresolution transforms, and triangle‑inequality exploitation. Novel combination yields near real‑time clustering (≈10× speedup on ARPES and other real-world datasets) without degrading k‑means accuracy. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,582 | A Flexible Framework for Query-oriented Interactive Community Search | 2025 | VLDB | 4.1945683e-05 |
| 10,716 | Federated and Balanced Clustering for High-dimensional Data | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,241 | Multi-dimensional Selectivity Estimation Using Compressed Histogram Information | 1999 | SIGMOD | 0.00013097578 |
| 2,093 | Scalable K-Means++ | 2012 | VLDB | 9.5588104e-05 |
| 4,652 | On the Efficiency of K-Means Clustering: Evaluation, Optimization, and Algorithm Selection | 2021 | VLDB | 6.0228549e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,971 | Settling Time vs. Accuracy Tradeoffs for Clustering Big Data | 2024 | SIGMOD | 4.1945683e-05 |
| 11,466 | Fast Density-Peaks Clustering: Multicore-based Parallelization Approach | 2021 | SIGMOD | 4.1945683e-05 |
| 12,622 | A Shrinking-Based Approach for Multi-Dimensional Data Analysis | 2003 | VLDB | 4.1945683e-05 |
| 2,019 | Finding Generalized Projected Clusters in High Dimensional Spaces | 2000 | SIGMOD | 9.7707059e-05 |
| 9,420 | Local Search Methods for k-Means with Outliers | 2017 | VLDB | 4.3441378e-05 |
| 8,168 | Evaluating Clustering in Subspace Projections of High Dimensional Data | 2009 | VLDB | 4.5701004e-05 |
| 2,093 | Scalable K-Means++ | 2012 | VLDB | 9.5588104e-05 |
| 4,652 | On the Efficiency of K-Means Clustering: Evaluation, Optimization, and Algorithm Selection | 2021 | VLDB | 6.0228549e-05 |
| 10,943 | Efficient Algorithm for K-Multiple-Means | 2024 | SIGMOD | 4.1945683e-05 |
| 12,571 | k-Means Projective Clustering | 2004 | PODS | 4.1945683e-05 |