Database Paper Browser

Back to papers

ICARUS: Minimizing Human Effort in Iterative Data Completion

Summary: ICARUS reduces expert labor by presenting small, high-impact matrix subsets for edits to the matrix. Schema-informed hierarchies amplify edits into rules; heuristic subset selection yields ~50% improvement, with users filling 68% in an hour. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11737
Venue
VLDB
Year
2018
Pagerank
4.6564959e-05
Overall Rank
7,766 | 45.98%
DOI
10.14778/3275366.3275374

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 1 of 1 citing papers.

Rank Citing Paper Year Venue Pagerank
5,347 Adaptive Rule Discovery for Labeling Text Data 2021 SIGMOD 5.5560452e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 18 of 18 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
192 HoloClean: Holistic Data Repairs with Probabilistic Inference 2017 VLDB 0.00035728858
224 CORDS: Automatic Discovery of Correlations and Soft Functional Dependencies 2004 SIGMOD 0.00032746205
623 Improving Data Quality: Consistency and Accuracy 2007 VLDB 0.00018996374
656 ERACER: A Database Approach for Statistical Inference and Data Cleaning 2010 SIGMOD 0.00018588729
732 Discovering Data Quality Rules 2008 VLDB 0.00017465093
833 Guided Data Repair 2011 VLDB 0.00016138432
1,012 NADEEF: A Commodity Data Cleaning System 2013 SIGMOD 0.0001464733
1,188 On Generating Near-Optimal Tableaux for Conditional Functional Dependencies 2008 VLDB 0.00013441729
1,546 KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing 2015 SIGMOD 0.00011446851
2,184 A Sample-and-Clean Framework for Fast and Accurate Query Processing on Dirty Data 2014 SIGMOD 9.3429789e-05
2,506 Auto-Detect: Data-Driven Error Detection in Tables 2018 SIGMOD 8.6335464e-05
2,797 Query-Oriented Data Cleaning with Oracles 2015 SIGMOD 8.1108589e-05
3,067 CrowdFill: Collecting Structured Data from the Crowd 2014 SIGMOD 7.6180371e-05
3,230 Learning Semantic String Transformations from Examples 2012 VLDB 7.339123e-05
3,913 Rudolf: Interactive Rule Refinement System for Fraud Detection 2016 VLDB 6.6346244e-05
3,976 UGuide – User-Guided Discovery of FD-Detectable Errors 2017 SIGMOD 6.5736462e-05
6,444 Evaluating Interactive Data Systems: Workloads, Metrics, and Guidelines 2018 SIGMOD 5.059132e-05
8,475 DataProf: Semantic Profiling for Iterative Data Cleansing and Business Rule Acquisition 2018 SIGMOD 4.5028904e-05
Previous Page 1 / 1 Next

Semantically Similar Papers