Database Paper Browser

Back to papers

Efficient Algorithms for Mining Outliers from Large Data Sets

Summary: Outliers defined by kth-NN distance; rank by this metric and pick top-n. Partition-based pruning prunes regions to discard non-outliers early, beating nested-loop/index-join; NBA and synthetic data validate scalability with size and dimensionality. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
3204
Venue
SIGMOD
Year
2000
Pagerank
0.00017938417
Overall Rank
701 | 95.13%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 25 of 25 citing papers.

Rank Citing Paper Year Venue Pagerank
161 LOF: Identifying Density-Based Local Outliers 2000 SIGMOD 0.00039846974
1,253 Anomaly Detection in Time Series: A Comprehensive Evaluation 2022 VLDB 0.00013032074
1,854 Distance-based Outlier Detection in Data Streams 2016 VLDB 0.00010317762
2,629 Online Outlier Detection in Sensor Data Using Non-Parametric Models 2006 VLDB 8.4160309e-05
3,171 Interactive Outlier Exploration in Big Data Streams 2014 VLDB 7.4447236e-05
4,456 AutoOD: Automatic Outlier Detection 2023 SIGMOD 6.1704203e-05
4,552 Outlier Detection for High Dimensional Data 2001 SIGMOD 6.0922282e-05
4,554 A Demonstration of AutoOD: A Self-Tuning Anomaly Detection System 2022 VLDB 6.0911296e-05
4,584 Scalable Kernel Density Classification via Threshold-Based Pruning 2017 SIGMOD 6.0668364e-05
6,423 AutoTSAD: Unsupervised Holistic Anomaly Detection for Time Series Data 2024 VLDB 5.0670573e-05
6,991 Sharing-Aware Outlier Analytics over High-Volume Data Streams 2016 SIGMOD 4.8702811e-05
7,371 Benchmarking the Utility of w-event Differential Privacy Mechanisms - When Baselines Become Mighty Competitors 2023 VLDB 4.7497236e-05
7,575 Human-in-the-loop Outlier Detection 2020 SIGMOD 4.7068909e-05
8,228 Mining Approximate Top-K Subspace Anomalies in Multi-Dimensional Time-Series Data 2007 VLDB 4.5549459e-05
8,868 A Bayesian Method for Guessing the Extreme Values in a Data Set 2007 VLDB 4.4320869e-05
9,221 VisClean: Interactive Cleaning for Progressive Visualization 2020 VLDB 4.3699444e-05
9,420 Local Search Methods for k-Means with Outliers 2017 VLDB 4.3441378e-05
9,599 SPARTAN: Data-Adaptive Symbolic Time-Series Approximation 2025 SIGMOD 4.3177432e-05
9,709 Outlier Summarization via Human Interpretable Rules 2024 VLDB 4.299267e-05
9,787 Distance-Based Outlier Detection: Consolidation and Renewed Bearing 2010 VLDB 4.2823546e-05
10,466 A Structured Study of Multivariate Time-Series Distance Measures 2025 SIGMOD 4.1945683e-05
10,637 TAB: Unified Benchmarking of Time Series Anomaly Detection Methods 2025 VLDB 4.1945683e-05
10,738 TSB-AutoAD: Towards Automated Solutions for Time-Series Anomaly Detection 2025 VLDB 4.1945683e-05
11,500 Comprehensible Counterfactual Explanation on Kolmogorov-Smirnov Test 2021 VLDB 4.1945683e-05
12,040 Interactive Data Mining with 3D-Parallel-Coordinate-Trees 2013 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 9 of 9 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers