Database Paper Browser

Back to papers

Statistical Distortion: Consequences of Data Cleaning

Summary: Introduces statistical distortion as a metric for data cleaning impact. A scalable experimental framework evaluates glitch improvement, statistical distortion, and cost, addressing gaps in existing metrics; demonstrated on real-world data. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10401
Venue
VLDB
Year
2012
Pagerank
9.7764643e-05
Overall Rank
2,018 | 85.97%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 11 of 11 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 3 of 3 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
656 ERACER: A Database Approach for Statistical Inference and Data Cleaning 2010 SIGMOD 0.00018588729
2,686 Online Data Fusion 2011 VLDB 8.3053595e-05
3,713 GDR: A System for Guided Data Repair 2010 SIGMOD 6.8224341e-05
Previous Page 1 / 1 Next

Semantically Similar Papers