RP-DBSCAN: A Superfast Parallel DBSCAN Algorithm Based on Random Partitioning
Summary: RP-DBSCAN uses pseudo random, cell-based partitioning to balance load in parallel DBSCAN on skewed data. Compact two-level cell dictionary enables local clustering with light cross-partition merge on Spark, yielding up to 180x speedup. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Hwanjun Song
- 2. Jae-Gil Lee
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,417 | Theoretically-Efficient and Practical Parallel DBSCAN | 2020 | SIGMOD | 5.5194222e-05 |
| 6,125 | DenForest: Enabling Fast Deletion in Incremental Density-Based Clustering over Sliding Windows | 2022 | SIGMOD | 5.1987868e-05 |
| 7,480 | Towards Metric DBSCAN: Exact, Approximate, and Streaming Algorithms | 2024 | SIGMOD | 4.7180617e-05 |
| 11,181 | Fast Density-Based Clustering: Geometric Approach | 2023 | SIGMOD | 4.1945683e-05 |
| 11,331 | The Gibbs–Rand Model | 2022 | PODS | 4.1945683e-05 |
| 11,466 | Fast Density-Peaks Clustering: Multicore-based Parallelization Approach | 2021 | SIGMOD | 4.1945683e-05 |
| 11,477 | Fast Parallel Algorithms for Euclidean Minimum Spanning Tree and Hierarchical Spatial Clustering* | 2021 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 270 | OPTICS: Ordering Points To Identify the Clustering Structure | 1999 | SIGMOD | 0.00029505642 |
| 961 | DBSCAN Revisited: Mis-Claim, Un-Fixability, and Approximation | 2015 | SIGMOD | 0.00015001792 |
| 2,635 | NG-DBSCAN: Scalable Density-Based Clustering for Arbitrary Data | 2017 | VLDB | 8.4045788e-05 |
| 3,264 | Dynamic Density Based Clustering | 2017 | SIGMOD | 7.3094408e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,993 | A Partitioning Framework for Aggressive Data Skipping | 2014 | VLDB | 4.1945683e-05 |
| 961 | DBSCAN Revisited: Mis-Claim, Un-Fixability, and Approximation | 2015 | SIGMOD | 0.00015001792 |
| 3,264 | Dynamic Density Based Clustering | 2017 | SIGMOD | 7.3094408e-05 |
| 11,477 | Fast Parallel Algorithms for Euclidean Minimum Spanning Tree and Hierarchical Spatial Clustering* | 2021 | SIGMOD | 4.1945683e-05 |
| 2,635 | NG-DBSCAN: Scalable Density-Based Clustering for Arbitrary Data | 2017 | VLDB | 8.4045788e-05 |
| 11,466 | Fast Density-Peaks Clustering: Multicore-based Parallelization Approach | 2021 | SIGMOD | 4.1945683e-05 |
| 7,480 | Towards Metric DBSCAN: Exact, Approximate, and Streaming Algorithms | 2024 | SIGMOD | 4.7180617e-05 |
| 10,470 | Approximate DBSCAN under Differential Privacy | 2025 | SIGMOD | 4.1945683e-05 |
| 11,181 | Fast Density-Based Clustering: Geometric Approach | 2023 | SIGMOD | 4.1945683e-05 |
| 5,417 | Theoretically-Efficient and Practical Parallel DBSCAN | 2020 | SIGMOD | 5.5194222e-05 |