Database Paper Browser

Back to papers

Scorpion: Explaining Away Outliers in Aggregate Queries

Summary: Scorpion takes user-specified outlier points from aggregate results and derives input predicates that remove those outliers. Defines predicate influence and uses efficient max-influence search to reveal explanations faster than naive search. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10719
Venue
VLDB
Year
2013
Pagerank
0.0003363692
Overall Rank
214 | 98.52%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 82 citing papers.

Rank Citing Paper Year Venue Pagerank
460 SeeDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics 2015 VLDB 0.00022516069
942 A Formal Approach to Finding Explanations for Database Queries 2014 SIGMOD 0.00015155714
1,022 DBSherlock: A Performance Diagnostic Tool for Transactional Databases 2016 SIGMOD 0.00014614917
1,099 Interpretable and Informative Explanations of Outcomes 2015 VLDB 0.00014096312
1,337 HoloDetect: Few-Shot Learning for Error Detection 2019 SIGMOD 0.00012497164
1,612 Detecting Data Errors: Where are we and what needs to be done? 2016 VLDB 0.00011142794
1,627 Data Cleaning: Overview and Emerging Challenges 2016 SIGMOD 0.00011086905
1,634 Exathlon: A Benchmark for Explainable Anomaly Detection over Time Series 2021 VLDB 0.00011058945
2,104 Data Polygamy: The Many-Many Relationships among Urban Spatio-Temporal Data Sets 2016 SIGMOD 9.536298e-05
2,126 MacroBase: Prioritizing Attention in Fast Data 2017 SIGMOD 9.4887794e-05
2,154 DIFF: A Relational Interface for Large-Scale Data Explanation 2019 VLDB 9.4208667e-05
2,280 SMOKE: Fine-grained Lineage at Interactive Speed 2018 VLDB 9.1111033e-05
2,402 Causality and Explanations in Databases 2014 VLDB 8.8928361e-05
2,498 Support the Data Enthusiast: Challenges for Next-Generation Data-Analysis Systems 2014 VLDB 8.6465331e-05
2,649 Explaining Query Answers with Explanation-Ready Databases 2016 VLDB 8.3719123e-05
2,733 The Case for Data Visualization Management Systems [Vision Paper] 2014 VLDB 8.2078862e-05
2,753 Complaint-driven Training Data Debugging for Query 2.0 2020 SIGMOD 8.1724339e-05
3,105 Data X-Ray: A Diagnostic Tool for Data Errors 2015 SIGMOD 7.5568954e-05
3,299 SCODED: Statistical Constraint Oriented Data Error Detection 2020 SIGMOD 7.2546659e-05
3,319 Sketching Linear Classifiers over Data Streams 2018 SIGMOD 7.226439e-05
3,340 Toward Computational Fact-Checking 2014 VLDB 7.2030091e-05
3,393 Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows 2022 VLDB 7.1483239e-05
4,361 The Complexity of Resilience and Responsibility for Self-Join-Free Conjunctive Queries 2016 VLDB 6.2559141e-05
4,937 New Results for the Complexity of Resilience for Binary Conjunctive Queries with Self-Joins 2020 PODS 5.8187108e-05
5,128 CAPE: Explaining Outliers by Counterbalancing 2019 VLDB 5.6758584e-05
5,191 Going Beyond Provenance: Explaining Query Answers with Pattern-based Counterbalances 2019 SIGMOD 5.6378768e-05
5,192 Pattern Functional Dependencies for Data Cleaning 2020 VLDB 5.6375087e-05
5,222 Enabling SQL-based Training Data Debugging for Federated Learning 2022 VLDB 5.6210545e-05
5,313 XInsight: eXplainable Data Analysis Through The Lens of Causality 2023 SIGMOD 5.573009e-05
5,445 QFix: Diagnosing Errors through Query Histories 2017 SIGMOD 5.5020909e-05
5,560 PI2: End-to-end Interactive Visualization Interface Generation from Queries 2022 SIGMOD 5.4336252e-05
5,660 Descriptive and Prescriptive Data Cleaning 2014 SIGMOD 5.3847321e-05
5,691 Putting Things into Context: Rich Explanations for Query Answers using Join Graphs 2021 SIGMOD 5.3684557e-05
5,733 Explaining Wrong Queries Using Small Examples 2019 SIGMOD 5.3483446e-05
5,867 Combining Design and Performance in a Data Visualization Management System 2017 CIDR 5.296418e-05
6,475 Explain3D: Explaining Disagreements in Disjoint Datasets 2019 VLDB 5.0497183e-05
6,565 Toward Interpretable and Actionable Data Analysis with Explanations and Causality 2022 VLDB 5.0081626e-05
6,696 Approximate Summaries for Why and Why-not Provenance 2020 VLDB 4.9581958e-05
6,779 Explaining Inference Queries with Bayesian Optimization 2021 VLDB 4.9280116e-05
6,842 Towards Democratizing Relational Data Visualization 2019 SIGMOD 4.9103931e-05
7,013 Qualitative Data Cleaning 2016 VLDB 4.8619024e-05
7,022 A Unified Approach for Resilience and Causal Responsibility with Integer Linear Programming (ILP) and LP Relaxations 2023 SIGMOD 4.8576599e-05
7,172 Summarized Causal Explanations For Aggregate Views 2024 SIGMOD 4.8114797e-05
7,364 ExplainED: Explanations for EDA Notebooks 2020 VLDB 4.7519211e-05
7,556 Interactive Query Explanations Using Fine Grained Provenance 2022 SIGMOD 4.7117814e-05
7,833 Dependency-Driven Analytics: a Compass for Uncharted Data Oceans 2017 CIDR 4.6382648e-05
8,388 FEDEX: An Explainability Framework for Data Exploration Steps 2022 VLDB 4.5297787e-05
8,721 Aggregated Deletion Propagation for Counting Conjunctive Query Answers 2021 VLDB 4.4608778e-05
8,728 Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views 2015 VLDB 4.4589711e-05
8,830 LensXPlain: Visualizing and Explaining Contributing Subsets for Aggregate Query Answers 2019 VLDB 4.4404336e-05
Previous Page 1 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 8 of 8 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers