Back to papers
Looking for Trouble: Analyzing Classifier Behavior via Pattern Divergence
Summary: Proposes divergence over itemsets to quantify classifier behavior gaps in data subgroups via pattern mining. Shapley-value attribution quantifies each feature's contribution to divergence, enabling detection of critical/peculiar subgroups for validation.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 6175
- Venue
- SIGMOD
- Year
- 2021
- Pagerank
- 7.4589576e-05
- Overall Rank
- 3,162 | 78.01%
- DOI
-
10.1145/3448016.3457284
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 10 of 10 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 6,779 |
Explaining Inference Queries with Bayesian Optimization |
2021 |
VLDB |
4.9280116e-05 |
| 9,644 |
Fair and Actionable Causal Prescription Ruleset |
2025 |
SIGMOD |
4.3109001e-05 |
| 10,147 |
Causal Explanations for Disparate Trends: Where and Why? |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,504 |
Subgroup Discovery with Small and Alternative Feature Sets |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,524 |
Understanding the Black Box: A Deep Empirical Dive into Shapley Value Approximations for Tabular Data |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,555 |
Mining the Minoria: Unknown, Under-represented, and Under-performing Minority Groups |
2025 |
VLDB |
4.1945683e-05 |
| 10,617 |
Deduplicated Sampling On-Demand |
2025 |
VLDB |
4.1945683e-05 |
| 11,034 |
CohortNet: Empowering Cohort Discovery for Interpretable Healthcare Analytics |
2024 |
VLDB |
4.1945683e-05 |
| 11,121 |
Demonstration of VCR: A Tabular Data Slicing Approach to Understanding Object Detection Model Performance |
2024 |
VLDB |
4.1945683e-05 |
| 11,523 |
How Divergent Is Your Data? |
2021 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 2 of 2 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 6,067 |
IL-Miner: Instance-Level Discovery of Complex Event Patterns |
2017 |
VLDB |
5.2290408e-05 |
| 1,336 |
Clustering Categorical Data: An Approach Based on Dynamical Systems |
1998 |
VLDB |
0.00012498064 |
| 6,347 |
A Relational Framework for Classifier Engineering |
2017 |
PODS |
5.1019568e-05 |
| 10,555 |
Mining the Minoria: Unknown, Under-represented, and Under-performing Minority Groups |
2025 |
VLDB |
4.1945683e-05 |
| 10,975 |
Language-Model Based Informed Partition of Databases to Speed Up Pattern Mining |
2024 |
SIGMOD |
4.1945683e-05 |
| 11,039 |
Efficient Discovery of Significant Patterns with Few-Shot Resampling |
2024 |
VLDB |
4.1945683e-05 |
| 12,408 |
Detecting Clusters in Moderate-to-High Dimensional Data: Subspace Clustering, Pattern-based Clustering, and Correlation Clustering |
2008 |
VLDB |
4.1945683e-05 |
| 6,544 |
A Framework for Measuring Changes in Data Characteristics |
1999 |
PODS |
5.0202405e-05 |
| 11,523 |
How Divergent Is Your Data? |
2021 |
VLDB |
4.1945683e-05 |
| 10,504 |
Subgroup Discovery with Small and Alternative Feature Sets |
2025 |
SIGMOD |
4.1945683e-05 |