WiClean: A System for Fixing Wikipedia Interlinks Using Revision History Patterns

Summary: WiClean plugs into Wikipedia, fixing interlink errors by mining revision-history patterns to signal incomplete links and propose corrections. Demonstrates entity coverage and interactive repair with VLDB'19 editors, showing practical effectiveness. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 11883
Venue: VLDB
Year: 2019
Pagerank: 4.1905499e-05
Overall Rank: 11,685 | 18.79%
DOI: 10.14778/3352063.3352081

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank

Outgoing Citations (Sorted by Pagerank)

Showing 4 of 4 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
556	Discovering Denial Constraints	2013	VLDB	0.00020214701
1,197	The LLUNATIC Data-Cleaning Framework	2013	VLDB	0.00013373177
1,544	KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing	2015	SIGMOD	0.00011438274
2,805	Query-Oriented Data Cleaning with Oracles	2015	SIGMOD	8.103731e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
7,868	Learning Over Dirty Data Without Cleaning	2020	SIGMOD	4.6276013e-05
5,454	QFix: Diagnosing Errors through Query Histories	2017	SIGMOD	5.4967096e-05
5,420	Mining an "Anti-Knowledge Base" from Wikipedia Updates with Applications to Fact Checking and Beyond	2020	VLDB	5.5154502e-05
10,730	UniClean: A Scalable Data Cleaning Solution for Mixed Errors based on Unified Cleaners and Optimized Cleaning Workflow	2025	VLDB	4.1905499e-05
5,930	ActiveClean: An Interactive Data Cleaning Framework For Modern Machine Learning	2016	SIGMOD	5.2632185e-05
3,198	Towards Dependable Data Repairing with Fixing Rules	2014	SIGMOD	7.4029546e-05
1,160	Towards Certain Fixes with Editing Rules and Master Data	2010	VLDB	0.0001358129
11,687	IHCS: An Integrated Hybrid Cleaning System	2019	VLDB	4.1905499e-05
7,565	PIClean: A Probabilistic and Interactive Data Cleaning System	2019	SIGMOD	4.7048523e-05
9,224	VisClean: Interactive Cleaning for Progressive Visualization	2020	VLDB	4.3657563e-05