Database Paper Browser

Back to papers

Complaint-driven Training Data Debugging for Query 2.0

Summary: Rain is a complaint-driven debugger for training data in Query 2.0, letting users lodge complaints on outputs to prune minimal training-set fixes. Two influence-function-based heuristics enable linear retraining, achieving high recall@k and interactive performance on real-world datasets. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5911
Venue
SIGMOD
Year
2020
Pagerank
8.1724339e-05
Overall Rank
2,753 | 80.85%
DOI
10.1145/3318464.3389696

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 15 of 15 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 35 of 35 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
31 Provenance Semirings 2007 PODS 0.0007857786
74 Efficient Query Evaluation on Probabilistic Databases 2004 VLDB 0.00057857292
140 The MADlib Analytics Library or MAD Skills, the SQL 2012 VLDB 0.00042270404
192 HoloClean: Holistic Data Repairs with Probabilistic Inference 2017 VLDB 0.00035728858
214 Scorpion: Explaining Away Outliers in Aggregate Queries 2013 VLDB 0.0003363692
329 Accelerating Machine Learning Inference with Probabilistic Predicates 2018 SIGMOD 0.00027249545
487 Why Not? 2009 SIGMOD 0.00022050218
543 MLbase: A Distributed Machine-learning System 2013 CIDR 0.00020526854
557 SystemML: Declarative Machine Learning on Spark 2016 VLDB 0.00020197988
712 Magellan: Toward Building Entity Matching Management Systems 2016 VLDB 0.00017732426
791 ActiveClean: Interactive Data Cleaning For Statistical Modeling 2016 VLDB 0.00016629664
942 A Formal Approach to Finding Explanations for Database Queries 2014 SIGMOD 0.00015155714
1,041 Interventional Fairness : Causal Database Repair for Algorithmic Fairness 2019 SIGMOD 0.00014482047
1,106 Provenance for Aggregate Queries 2011 PODS 0.0001398766
1,125 How to ConQueR Why-Not Questions 2010 SIGMOD 0.00013845652
1,337 HoloDetect: Few-Shot Learning for Error Detection 2019 SIGMOD 0.00012497164
1,371 Tiresias: The Database Oracle for How-To Queries 2012 SIGMOD 0.00012323502
1,420 Data Management Challenges in Production Machine Learning 2017 SIGMOD 0.00012057956
1,612 Detecting Data Errors: Where are we and what needs to be done? 2016 VLDB 0.00011142794
1,699 Sensitivity Analysis and Explanations for Robust Query Evaluation in Probabilistic Databases 2011 SIGMOD 0.00010858983
2,154 DIFF: A Relational Interface for Large-Scale Data Explanation 2019 VLDB 9.4208667e-05
2,402 Causality and Explanations in Databases 2014 VLDB 8.8928361e-05
2,649 Explaining Query Answers with Explanation-Ready Databases 2016 VLDB 8.3719123e-05
2,968 Raha: A Configuration-Free Error Detection System 2019 SIGMOD 7.7985097e-05
3,105 Data X-Ray: A Diagnostic Tool for Data Errors 2015 SIGMOD 7.5568954e-05
3,218 Reverse Data Management 2011 VLDB 7.3592173e-05
3,663 Reverse Engineering Aggregation Queries 2017 VLDB 6.8647221e-05
3,773 Cleaning Crowdsourced Labels Using Oracles for Statistical Classification 2019 VLDB 6.7758649e-05
3,875 Cloudy with High Chance of DBMS: A 10-year Prediction for Enterprise-Grade ML 2020 CIDR 6.675257e-05
4,087 Snorkel: Fast Training Set Generation for Information Extraction 2017 SIGMOD 6.4607746e-05
4,196 Overton: A Data System for Monitoring and Improving Machine-Learned Products 2020 CIDR 6.3686231e-05
5,445 QFix: Diagnosing Errors through Query Histories 2017 SIGMOD 5.5020909e-05
6,373 DeepBase: Deep Inspection of Neural Networks 2019 SIGMOD 5.0929326e-05
7,262 HypDB: A Demonstration of Detecting, Explaining and Resolving Bias in OLAP queries 2018 VLDB 4.78584e-05
7,820 Subjective Databases 2019 VLDB 4.6431208e-05
Previous Page 1 / 1 Next

Semantically Similar Papers