Database Paper Browser

Back to papers

A Bayesian Approach to Discovering Truth from Conflicting Sources for Data Integration

Summary: Bayesian model for truth finding in data integration; infers true records and source quality, FP/FN modeling for multivalued attributes. Scalable, linear time, sampling-based inference with an incremental variant, outperforming prior methods on real data. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10500
Venue
VLDB
Year
2012
Pagerank
0.00025389696
Overall Rank
371 | 97.43%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 26 of 26 citing papers.

Rank Citing Paper Year Venue Pagerank
254 Snorkel: Rapid Training Data Creation with Weak Supervision 2018 VLDB 0.00030540555
908 Fusing Data with Correlations 2014 SIGMOD 0.00015431241
1,211 Truth Finding on the Deep Web: Is the Problem Solved? 2013 VLDB 0.00013257101
2,420 From Data Fusion to Knowledge Fusion 2014 VLDB 8.8530994e-05
2,567 Resolving Conflicts in Heterogeneous Data by Truth Discovery and Source Reliability Estimation 2014 SIGMOD 8.5239306e-05
3,105 Data X-Ray: A Diagnostic Tool for Data Errors 2015 SIGMOD 7.5568954e-05
3,263 QASCA: A Quality-Aware Task Assignment System for Crowdsourcing Applications 2015 SIGMOD 7.3097573e-05
3,495 Knowledge-Based Trust: Estimating the Trustworthiness of Web Sources 2015 VLDB 7.0400666e-05
3,897 SLiMFast: Guaranteed Results for Data Fusion and Source Reliability 2017 SIGMOD 6.6554845e-05
4,011 A Confidence-Aware Approach for Truth Discovery on Long-Tail Data 2015 VLDB 6.5343479e-05
4,101 Less is More: Selecting Sources Wisely for Integration 2013 VLDB 6.4523909e-05
4,904 Temporal Rules Discovery for Web Data Cleaning 2016 VLDB 5.8399195e-05
5,251 Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale 2019 SIGMOD 5.6029615e-05
5,405 Truth Discovery and Crowdsourcing Aggregation: A Unified Perspective 2015 VLDB 5.5257718e-05
5,445 QFix: Diagnosing Errors through Query Histories 2017 SIGMOD 5.5020909e-05
6,354 Characterizing and Selecting Fresh Data Sources 2014 SIGMOD 5.0990729e-05
6,557 Knowledge Verification for Long-Tail Verticals 2017 VLDB 5.0124455e-05
6,780 Domain-Aware Multi-Truth Discovery from Conflicting Sources 2018 VLDB 4.9277708e-05
7,648 User Guidance for Efficient Fact Checking 2019 VLDB 4.6889787e-05
7,784 Authenticated Online Data Integration Services 2015 SIGMOD 4.6517065e-05
8,086 Determining the Relative Accuracy of Attributes 2013 SIGMOD 4.5899469e-05
8,362 Minimizing Efforts in Validating Crowd Answers 2015 SIGMOD 4.5366717e-05
10,965 High Precision ≠ High Cost: Temporal Data Fusion for Multiple Low-Precision Sensors 2024 SIGMOD 4.1945683e-05
11,006 FusionQuery: On-demand Fusion Queries over Multi-source Heterogeneous Data 2024 VLDB 4.1945683e-05
11,799 Truth Discovery for Spatio-Temporal Events from Crowdsourced Data 2017 VLDB 4.1945683e-05
12,149 Mining Knowledge from Interconnected Data: A Heterogeneous Information Network Analysis Approach 2012 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 4 of 4 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
49 Consistent Query Answers in Inconsistent Databases 1999 PODS 0.00067660624
855 Integrating Conflicting Data: The Role of Source Dependence 2009 VLDB 0.00015906735
1,246 Truth Discovery and Copying Detection in a Dynamic World 2009 VLDB 0.0001307161
1,289 Using Probabilistic Information in Data Integration 1997 VLDB 0.00012804879
Previous Page 1 / 1 Next

Semantically Similar Papers