Database Paper Browser

Back to papers

BIRCH: An Efficient Data Clustering Method for Very Large Databases

Summary: BIRCH offers incremental, memory-efficient clustering for very large databases; first DB clustering method to effectively handle noise. Shows strong time/space efficiency and single-scan quality; outperforms CLARANS on large datasets. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
2875
Venue
SIGMOD
Year
1996
Pagerank
0.00077324389
Overall Rank
33 | 99.78%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 34 of 84 citing papers.

Rank Citing Paper Year Venue Pagerank
6,544 A Framework for Measuring Changes in Data Characteristics 1999 PODS 5.0202405e-05
6,653 Supporting Ranking and Clustering as Generalized Order-By and Group-By 2007 SIGMOD 4.9735307e-05
6,799 Incremental and Effective Data Summarization for Dynamic Hierarchical Clustering 2004 SIGMOD 4.9232394e-05
6,883 C2P: Clustering based on Closest Pairs 2001 VLDB 4.8960306e-05
6,894 TableDC: Deep Clustering for Tabular Data 2025 SIGMOD 4.8925595e-05
7,419 A Shared Execution Strategy for Multiple Pattern Mining Requests over Streaming Data 2009 VLDB 4.7348504e-05
7,488 Data Stream Clustering: An In-depth Empirical Study 2023 SIGMOD 4.7180617e-05
7,608 Clustering Objects on a Spatial Network 2004 SIGMOD 4.6967024e-05
8,466 Building Statistical Models and Scoring with UDFs 2007 SIGMOD 4.5050696e-05
8,779 Summarization and Matching of Density-Based Clusters in Streaming Environments 2012 VLDB 4.4539178e-05
8,789 Machine Learning Meets Big Spatial Data 2019 VLDB 4.4509194e-05
8,838 Data Bubbles: Quality Preserving Performance Boosting for Hierarchical Clustering 2001 SIGMOD 4.438882e-05
8,979 High Performance Stream Query Processing With Correlation-Aware Partitioning 2014 VLDB 4.4170433e-05
9,067 MAIDS: Mining Alarming Incidents from Data Streams 2004 SIGMOD 4.4034035e-05
9,068 A Framework for Projected Clustering of High Dimensional Data Streams 2004 VLDB 4.4034035e-05
9,573 DataLens: Making a Good First Impression 2009 SIGMOD 4.3254101e-05
9,657 CloudVista: Interactive and Economical Visual Cluster Analysis for Big Data in the Cloud 2012 VLDB 4.3109001e-05
9,787 Distance-Based Outlier Detection: Consolidation and Renewed Bearing 2010 VLDB 4.2823546e-05
10,309 CLaP - State Detection from Time Series 2026 VLDB 4.1945683e-05
10,557 Explaining Black-Box Clustering Pipelines With Cluster-Explorer 2025 VLDB 4.1945683e-05
10,624 Evaluating Methods for Efficient Entity Count Estimation 2025 VLDB 4.1945683e-05
10,718 BURST: Rendering Clustering Techniques Suitable for Evolving Streams 2025 VLDB 4.1945683e-05
10,945 Efficient High-Quality Clustering for Large Bipartite Graphs 2024 SIGMOD 4.1945683e-05
10,971 Settling Time vs. Accuracy Tradeoffs for Clustering Big Data 2024 SIGMOD 4.1945683e-05
11,942 AIDE: An Automatic User Navigation System for Interactive Data Exploration 2015 VLDB 4.1945683e-05
12,479 On Dominating Your Neighborhood Profitably 2007 VLDB 4.1945683e-05
12,571 k-Means Projective Clustering 2004 PODS 4.1945683e-05
12,573 Cost-Based Labeling of Groups of Mass Spectra 2004 SIGMOD 4.1945683e-05
12,613 Capacity Bound-free Web Warehouse 2003 CIDR 4.1945683e-05
12,622 A Shrinking-Based Approach for Multi-Dimensional Data Analysis 2003 VLDB 4.1945683e-05
12,623 Data Bubbles for Non-Vector Data: Speeding-up Hierarchical Clustering in Arbitrary Metric Spaces 2003 VLDB 4.1945683e-05
12,698 INSITE: A Tool for Real-Time Knowledge Discovery from Users Web Navigation 2000 VLDB 4.1945683e-05
12,766 DEVise: Integrated Querying and Visual Exploration of Large Datasets (DEMO ABSTRACT) 1997 SIGMOD 4.1945683e-05
12,769 GeoMiner: A System Prototype for Spatial Data Mining 1997 SIGMOD 4.1945683e-05
Previous Page 2 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 1 of 1 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
27 Efficient and Effective Clustering Methods for Spatial Data Mining 1994 VLDB 0.00080736878
Previous Page 1 / 1 Next

Semantically Similar Papers