Finding Generalized Projected Clusters in High Dimensional Spaces

Summary: Generalized projected clustering finds clusters in arbitrarily aligned subspaces, subspaces defined per cluster, not from original attributes. Uses extended feature vectors to scale to large databases; provides tunable time/space–accuracy tradeoffs. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 3175
Venue: SIGMOD
Year: 2000
Pagerank: 9.7586371e-05
Overall Rank: 2,023 | 85.95%
DOI: -

Incoming Non-self Citations Over Time

Authors

1. Charu C. Aggarwal
2. Philip S. Yu

Incoming Citations (Sorted by Pagerank)

Showing 20 of 20 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
2,099	What is the nearest neighbor in high dimensional spaces?	2000	VLDB	9.5429949e-05
3,381	A Monte Carlo Algorithm for Fast Projective Clustering	2002	SIGMOD	7.1538392e-05
4,125	Computing Clusters of Correlation Connected Objects	2004	SIGMOD	6.4248096e-05
4,357	triCluster: An Effective Algorithm for Mining Coherent Clusters in 3D Microarray Data	2005	SIGMOD	6.251058e-05
4,549	Outlier Detection for High Dimensional Data	2001	SIGMOD	6.0866685e-05
4,818	Clustering by Pattern Similarity in Large Data Sets	2002	SIGMOD	5.8955574e-05
5,061	Hierarchical Subspace Sampling: A Unified Framework for High Dimensional Data Reduction, Selectivity Estimation and Nearest Neighbor Search	2002	SIGMOD	5.717245e-05
5,078	Combi-Operator – Database Support for Data Mining Applications	2003	VLDB	5.708568e-05
5,768	Outlier-robust Clustering using Independent Components	2008	SIGMOD	5.3332593e-05
6,289	Putting Context into Schema Matching	2006	VLDB	5.1226008e-05
6,316	On the Effects of Dimensionality Reduction on High Dimensional Similarity Search	2001	PODS	5.1098143e-05
6,892	C2P: Clustering based on Closest Pairs	2001	VLDB	4.8889412e-05
7,835	CURLER: Finding and Visualizing Nonlinear Correlation Clusters	2005	SIGMOD	4.6355641e-05
8,226	Mining Approximate Top-K Subspace Anomalies in Multi-Dimensional Time-Series Data	2007	VLDB	4.5505782e-05
8,473	Evaluating Clustering in Subspace Projections of High Dimensional Data	2009	VLDB	4.4985288e-05
12,048	Interactive Data Mining with 3D-Parallel-Coordinate-Trees	2013	SIGMOD	4.1905499e-05
12,387	Constrained Locally Weighted Clustering	2008	VLDB	4.1905499e-05
12,417	Detecting Clusters in Moderate-to-High Dimensional Data: Subspace Clustering, Pattern-based Clustering, and Correlation Clustering	2008	VLDB	4.1905499e-05
12,580	k-Means Projective Clustering	2004	PODS	4.1905499e-05
12,668	An Automated System for Web Portal Personalization	2002	VLDB	4.1905499e-05

Outgoing Citations (Sorted by Pagerank)

Showing 0 of 0 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
5,061	Hierarchical Subspace Sampling: A Unified Framework for High Dimensional Data Reduction, Selectivity Estimation and Nearest Neighbor Search	2002	SIGMOD	5.717245e-05
13,814	Design and Analysis of Subspace Clustering Algorithms and their Applicability	2002	VLDB	-
13,940	Clustering Methods for Large Databases: From the Past to the Future	1999	SIGMOD	-
1,802	Local Dimensionality Reduction: A New Approach to Indexing High Dimensional Spaces	2000	VLDB	0.0001048177
2,099	What is the nearest neighbor in high dimensional spaces?	2000	VLDB	9.5429949e-05
9,067	A Framework for Projected Clustering of High Dimensional Data Streams	2004	VLDB	4.3991821e-05
12,417	Detecting Clusters in Moderate-to-High Dimensional Data: Subspace Clustering, Pattern-based Clustering, and Correlation Clustering	2008	VLDB	4.1905499e-05
12,580	k-Means Projective Clustering	2004	PODS	4.1905499e-05
8,473	Evaluating Clustering in Subspace Projections of High Dimensional Data	2009	VLDB	4.4985288e-05
1,596	Fast Algorithms for Projected Clustering	1999	SIGMOD	0.00011210688