Scorpion: Explaining Away Outliers in Aggregate Queries
Summary: Scorpion takes user-specified outlier points from aggregate results and derives input predicates that remove those outliers. Defines predicate influence and uses efficient max-influence search to reveal explanations faster than naive search. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Eugene Wu
- 2. Samuel Madden
Incoming Citations (Sorted by Pagerank)
Showing 50 of 82 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 277 | Automatic Subspace Clustering of High Dimensional Data for Data Mining Applications | 1998 | SIGMOD | 0.00029311426 |
| 767 | Explaining differences in multidimensional aggregates | 1999 | VLDB | 0.00016981309 |
| 1,000 | Intelligent Rollups in Multidimensional OLAP Data | 2001 | VLDB | 0.00014709252 |
| 1,534 | PerfXplain: Debugging MapReduce Job Performance | 2012 | VLDB | 0.00011468393 |
| 1,699 | Sensitivity Analysis and Explanations for Robust Query Evaluation in Probabilistic Databases | 2011 | SIGMOD | 0.00010858983 |
| 1,970 | Approximate Lineage for Probabilistic Databases | 2008 | VLDB | 9.896375e-05 |
| 2,602 | Tracing Data Errors with View-Conditioned Causality | 2011 | SIGMOD | 8.4667197e-05 |
| 2,852 | MRI: Meaningful Interpretations of Collaborative Ratings | 2011 | VLDB | 8.0151391e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,556 | Interactive Query Explanations Using Fine Grained Provenance | 2022 | SIGMOD | 4.7117814e-05 |
| 701 | Efficient Algorithms for Mining Outliers from Large Data Sets | 2000 | SIGMOD | 0.00017938417 |
| 7,314 | Efficient Evaluation of Object-Centric Exploration Queries for Visualization | 2015 | VLDB | 4.7648346e-05 |
| 6,991 | Sharing-Aware Outlier Analytics over High-Volume Data Streams | 2016 | SIGMOD | 4.8702811e-05 |
| 774 | Algorithms for Mining Distance-Based Outliers in Large Datasets | 1998 | VLDB | 0.00016865771 |
| 5,128 | CAPE: Explaining Outliers by Counterbalancing | 2019 | VLDB | 5.6758584e-05 |
| 10,003 | Clustering with Set Outliers and Applications in Relational Clustering | 2026 | PODS | 4.1945683e-05 |
| 2,649 | Explaining Query Answers with Explanation-Ready Databases | 2016 | VLDB | 8.3719123e-05 |
| 7,351 | Distributed Outlier Detection using Compressive Sensing | 2015 | SIGMOD | 4.7545562e-05 |
| 5,191 | Going Beyond Provenance: Explaining Query Answers with Pattern-based Counterbalances | 2019 | SIGMOD | 5.6378768e-05 |