| 112 |
Potter's Wheel: An Interactive Data Cleaning System |
2001 |
VLDB |
0.00047045036 |
| 119 |
Answering Queries using Humans, Algorithms and Databases |
2011 |
CIDR |
0.0004564788 |
| 214 |
Scorpion: Explaining Away Outliers in Aggregate Queries |
2013 |
VLDB |
0.0003363692 |
| 263 |
CrowdER: Crowdsourcing Entity Resolution |
2012 |
VLDB |
0.00029862413 |
| 265 |
A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification |
2005 |
SIGMOD |
0.00029763412 |
| 280 |
Eliminating Fuzzy Duplicates in Data Warehouses |
2002 |
VLDB |
0.00029113044 |
| 477 |
Model-Driven Data Acquisition in Sensor Networks |
2004 |
VLDB |
0.00022221803 |
| 489 |
Data Curation at Scale: The Data Tamer System |
2013 |
CIDR |
0.00022030728 |
| 507 |
Data Quality and Data Cleaning: An Overview |
2003 |
SIGMOD |
0.00021473263 |
| 555 |
Discovering Denial Constraints |
2013 |
VLDB |
0.00020254908 |
| 623 |
Improving Data Quality: Consistency and Accuracy |
2007 |
VLDB |
0.00018996374 |
| 643 |
Corleone: Hands-Off Crowdsourcing for Entity Matching |
2014 |
SIGMOD |
0.00018754451 |
| 656 |
ERACER: A Database Approach for Statistical Inference and Data Cleaning |
2010 |
SIGMOD |
0.00018588729 |
| 833 |
Guided Data Repair |
2011 |
VLDB |
0.00016138432 |
| 866 |
Leveraging Transitive Relations for Crowdsourced Joins |
2013 |
SIGMOD |
0.00015801196 |
| 881 |
Don’t be SCAREd: Use SCalable Automatic REpairing with Maximal Likelihood and Bounded Changes |
2013 |
SIGMOD |
0.00015661103 |
| 1,012 |
NADEEF: A Commodity Data Cleaning System |
2013 |
SIGMOD |
0.0001464733 |
| 1,159 |
Towards Certain Fixes with Editing Rules and Master Data |
2010 |
VLDB |
0.00013592813 |
| 1,164 |
CrowdScreen: Algorithms for Filtering Data with Humans |
2012 |
SIGMOD |
0.00013564823 |
| 1,188 |
On Generating Near-Optimal Tableaux for Conditional Functional Dependencies |
2008 |
VLDB |
0.00013441729 |
| 1,197 |
The LLUNATIC Data-Cleaning Framework |
2013 |
VLDB |
0.00013390321 |
| 1,242 |
Question Selection for Crowd Entity Resolution |
2013 |
VLDB |
0.00013096655 |
| 1,546 |
KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing |
2015 |
SIGMOD |
0.00011446851 |
| 1,594 |
Adaptive Cleaning for RFID Data Streams |
2006 |
VLDB |
0.00011222484 |
| 1,624 |
Sampling the Repairs of Functional Dependency Violations under Hard Constraints |
2010 |
VLDB |
0.00011099222 |
| 2,184 |
A Sample-and-Clean Framework for Fast and Accurate Query Processing on Dirty Data |
2014 |
SIGMOD |
9.3429789e-05 |
| 2,231 |
Dedoop: Efficient Deduplication with Hadoop |
2012 |
VLDB |
9.2304499e-05 |
| 2,602 |
Tracing Data Errors with View-Conditioned Causality |
2011 |
SIGMOD |
8.4667197e-05 |
| 2,629 |
Online Outlier Detection in Sensor Data Using Non-Parametric Models |
2006 |
VLDB |
8.4160309e-05 |
| 2,722 |
Progressive Approach to Relational Entity Resolution |
2014 |
VLDB |
8.2338356e-05 |
| 2,797 |
Query-Oriented Data Cleaning with Oracles |
2015 |
SIGMOD |
8.1108589e-05 |
| 2,823 |
Interaction between Record Matching and Data Repairing |
2011 |
SIGMOD |
8.0593894e-05 |
| 2,946 |
BigDansing: A System for Big Data Cleansing |
2015 |
SIGMOD |
7.8372441e-05 |
| 3,067 |
CrowdFill: Collecting Structured Data from the Crowd |
2014 |
SIGMOD |
7.6180371e-05 |
| 3,118 |
Scaling Up Crowd-Sourcing to Very Large Datasets: A Case for Active Learning |
2015 |
VLDB |
7.5379338e-05 |
| 3,192 |
Towards Dependable Data Repairing with Fixing Rules |
2014 |
SIGMOD |
7.4095761e-05 |
| 3,360 |
Modeling and Querying Possible Repairs in Duplicate Detection |
2009 |
VLDB |
7.1742067e-05 |
| 3,920 |
Continuous Outlier Detection in Data Streams: An Extensible Framework and State-Of-The-Art Algorithms |
2013 |
SIGMOD |
6.6309693e-05 |
| 4,451 |
CLAMShell: Speeding up Crowds for Low-latency Data Labeling |
2016 |
VLDB |
6.1738675e-05 |
| 5,586 |
QuERy: A Framework for Integrating Entity Resolution with Query Processing |
2016 |
VLDB |
5.4219548e-05 |
| 5,660 |
Descriptive and Prescriptive Data Cleaning |
2014 |
SIGMOD |
5.3847321e-05 |
| 6,941 |
Estimating the Impact of Unknown Unknowns on Aggregate Query Results |
2016 |
SIGMOD |
4.8924e-05 |
| 8,148 |
When Speed Has a Price: Fast Information Extraction Using Approximate Algorithms |
2013 |
VLDB |
4.5754467e-05 |
| 8,593 |
Wisteria: Nurturing Scalable Data Cleaning Infrastructure |
2015 |
VLDB |
4.4891474e-05 |
| 8,728 |
Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views |
2015 |
VLDB |
4.4589711e-05 |