Database Paper Browser

Back to papers

Human-in-the-loop Outlier Detection

Summary: HOD combines machine outliers with human validation to reveal true outliers beyond statistical rarity. Context-inlier clustering aids interpretation; a bipartite-graph selector minimizes queries while covering all candidates, with empirical gains. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5987
Venue
SIGMOD
Year
2020
Pagerank
4.7068909e-05
Overall Rank
7,575 | 47.31%
DOI
10.1145/3318464.3389772

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 16 of 16 citing papers.

Rank Citing Paper Year Venue Pagerank
4,102 GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data 2023 SIGMOD 6.4522929e-05
4,456 AutoOD: Automatic Outlier Detection 2023 SIGMOD 6.1704203e-05
4,825 Synthesizing Natural Language to Visualization (NL2VIS) Benchmarks from NL2SQL Benchmarks 2021 SIGMOD 5.8946721e-05
5,371 LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning 2022 SIGMOD 5.5428776e-05
5,381 Selective Data Acquisition in the Wild for Model Charging 2022 VLDB 5.5399508e-05
5,963 Automatic Data Acquisition for Deep Learning 2021 VLDB 5.2526794e-05
7,179 Coresets over Multiple Tables for Feature-rich and Data-efficient Machine Learning 2023 VLDB 4.8078895e-05
8,116 LakeBench: A Benchmark for Discovering Joinable and Unionable Tables in Data Lakes 2024 VLDB 4.581507e-05
8,268 Learned Data-aware Image Representations of Line Charts for Similarity Search 2023 SIGMOD 4.5456668e-05
9,077 VerifAI: Verified Generative AI 2024 CIDR 4.4010762e-05
9,709 Outlier Summarization via Human Interpretable Rules 2024 VLDB 4.299267e-05
9,771 EasyDR: A Human-in-the-loop Error Detection and Repair Platform for Holistic Table Cleaning 2022 VLDB 4.2856106e-05
10,029 Outliers: The Good, the Bad and the Ugly 2026 SIGMOD 4.1945683e-05
10,216 The Case For Language Model Approximated LIKE Predicate 2026 SIGMOD 4.1945683e-05
10,289 LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning 2026 VLDB 4.1945683e-05
11,000 MisDetect: Iterative Mislabel Detection using Early Loss 2024 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 17 of 17 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers