Database Paper Browser

Back to papers

On the Efficiency of K-Means Clustering: Evaluation, Optimization, and Algorithm Selection

Summary: UniK unifies pruning-based accelerations for Lloyd's k-means into an evaluation framework with fine-grained performance breakdown. An optimized UniK-hybrid pruning strategy improves efficiency, with ML-based automatic selection of the best accelerator. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12350
Venue
VLDB
Year
2021
Pagerank
6.0228549e-05
Overall Rank
4,652 | 67.64%
DOI
10.14778/3425879.3425887

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 5 of 5 citing papers.

Rank Citing Paper Year Venue Pagerank
8,670 Marigold: Efficient k-means Clustering in High Dimensions 2023 VLDB 4.4715132e-05
10,317 Highly-Efficient Large-Scale k-means with Individual Fairness 2026 VLDB 4.1945683e-05
10,716 Federated and Balanced Clustering for High-dimensional Data 2025 VLDB 4.1945683e-05
11,193 Prerequisite-driven Fair Clustering on Heterogeneous Information Networks 2023 SIGMOD 4.1945683e-05
11,219 F3 KM: Federated, Fair, and Fast k-means 2023 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 9 of 9 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers

Overall Rank Paper Year Venue Pagerank
1,860 Approximation Algorithms for Clustering Uncertain Data 2008 PODS 0.0001028857
8,168 Evaluating Clustering in Subspace Projections of High Dimensional Data 2009 VLDB 4.5701004e-05
10,924 Improved Approximation Algorithms for Relational Clustering 2024 PODS 4.1945683e-05
12,571 k-Means Projective Clustering 2004 PODS 4.1945683e-05
10,317 Highly-Efficient Large-Scale k-means with Individual Fairness 2026 VLDB 4.1945683e-05
10,971 Settling Time vs. Accuracy Tradeoffs for Clustering Big Data 2024 SIGMOD 4.1945683e-05
11,045 Ensemble Clustering based on Meta-Learning and Hyperparameter Optimization 2024 VLDB 4.1945683e-05
10,943 Efficient Algorithm for K-Multiple-Means 2024 SIGMOD 4.1945683e-05
9,420 Local Search Methods for k-Means with Outliers 2017 VLDB 4.3441378e-05
2,093 Scalable K-Means++ 2012 VLDB 9.5588104e-05