Database Paper Browser

Back to papers

Truth Finding on the Deep Web: Is the Problem Solved?

Summary: Examines truthfulness of Deep Web data in stock and flight domains, finding pervasive inconsistency and varied accuracy across sources. Applies state-of-the-art data fusion to resolve conflicts, analyzes strengths/limits, and sketches directions for future research. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10757
Venue
VLDB
Year
2013
Pagerank
0.00013257101
Overall Rank
1,211 | 91.58%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 37 of 37 citing papers.

Rank Citing Paper Year Venue Pagerank
192 HoloClean: Holistic Data Repairs with Probabilistic Inference 2017 VLDB 0.00035728858
908 Fusing Data with Correlations 2014 SIGMOD 0.00015431241
1,894 Baran: Effective Error Correction via a Unified Context Representation and Transfer Learning 2020 VLDB 0.0001018378
2,359 Data Market Platforms: Trading Data Assets to Solve Data Problems 2020 VLDB 8.9607667e-05
2,420 From Data Fusion to Knowledge Fusion 2014 VLDB 8.8530994e-05
2,567 Resolving Conflicts in Heterogeneous Data by Truth Discovery and Source Reliability Estimation 2014 SIGMOD 8.5239306e-05
2,617 Extraction and Integration of Partially Overlapping Web Sources 2013 VLDB 8.4462621e-05
3,133 Time Series Data Cleaning: From Anomaly Detection to Anomaly Repairing 2017 VLDB 7.4978041e-05
3,263 QASCA: A Quality-Aware Task Assignment System for Crowdsourcing Applications 2015 SIGMOD 7.3097573e-05
3,495 Knowledge-Based Trust: Estimating the Trustworthiness of Web Sources 2015 VLDB 7.0400666e-05
3,897 SLiMFast: Guaranteed Results for Data Fusion and Source Reliability 2017 SIGMOD 6.6554845e-05
4,011 A Confidence-Aware Approach for Truth Discovery on Long-Tail Data 2015 VLDB 6.5343479e-05
4,101 Less is More: Selecting Sources Wisely for Integration 2013 VLDB 6.4523909e-05
4,607 Data Integration and Machine Learning: A Natural Synergy 2018 SIGMOD 6.0538827e-05
4,904 Temporal Rules Discovery for Web Data Cleaning 2016 VLDB 5.8399195e-05
5,002 Sequential Data Cleaning: A Statistical Approach 2016 SIGMOD 5.7671075e-05
6,354 Characterizing and Selecting Fresh Data Sources 2014 SIGMOD 5.0990729e-05
6,451 Multivariate Time Series Cleaning under Speed Constraints 2024 SIGMOD 5.0583324e-05
6,557 Knowledge Verification for Long-Tail Verticals 2017 VLDB 5.0124455e-05
6,583 SCREEN: Stream Data Cleaning under Speed Constraints 2015 SIGMOD 5.0027988e-05
6,780 Domain-Aware Multi-Truth Discovery from Conflicting Sources 2018 VLDB 4.9277708e-05
6,941 Estimating the Impact of Unknown Unknowns on Aggregate Query Results 2016 SIGMOD 4.8924e-05
7,223 Akane: Perplexity-Guided Time Series Data Cleaning 2024 SIGMOD 4.7965857e-05
7,243 Data Integration and Machine Learning: A Natural Synergy 2018 VLDB 4.7913666e-05
7,784 Authenticated Online Data Integration Services 2015 SIGMOD 4.6517065e-05
7,919 DEXTER: Large-Scale Discovery and Extraction of Product Specifications on the Web 2015 VLDB 4.616746e-05
8,149 Why Not Match: On Explanations of Event Pattern Queries 2021 SIGMOD 4.5752863e-05
8,840 The Cost of Representation by Subset Repairs 2025 VLDB 4.4388652e-05
8,849 SourceSight: Enabling Effective Source Selection 2016 SIGMOD 4.4369118e-05
9,348 GIDCL: A Graph-Enhanced Interpretable Data Cleaning Framework with Large Language Models 2024 SIGMOD 4.3526427e-05
9,924 On Saving Outliers for Better Clustering over Noisy Data 2021 SIGMOD 4.2544238e-05
10,510 Table Overlap Estimation through Graph Embeddings 2025 SIGMOD 4.1945683e-05
10,511 The Best of Both Worlds: On Repairing Timestamps and Attribute Values for Multivariate Time Series 2025 SIGMOD 4.1945683e-05
10,951 Determining the Largest Overlap between Tables 2024 SIGMOD 4.1945683e-05
11,006 FusionQuery: On-demand Fusion Queries over Multi-source Heterogeneous Data 2024 VLDB 4.1945683e-05
11,770 Staging User Feedback toward Rapid Conflict Resolution in Data Fusion 2017 SIGMOD 4.1945683e-05
11,895 Finding Quality in Quantity: The Challenge of Discovering Valuable Sources for Integration 2015 CIDR 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 9 of 9 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers