Characterizing and Selecting Fresh Data Sources
Summary: Dynamic source selection for data integration; introduces time-dependent metrics (coverage, freshness, accuracy) and statistical models to estimate them. Despite NP-hardness, presents a near-optimal algorithmic framework with theoretical guarantees, validated on real and synthetic data. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,897 | SLiMFast: Guaranteed Results for Data Fusion and Source Reliability | 2017 | SIGMOD | 6.6554845e-05 |
| 4,011 | A Confidence-Aware Approach for Truth Discovery on Long-Tail Data | 2015 | VLDB | 6.5343479e-05 |
| 7,345 | Linking Temporal Records for Profiling Entities | 2015 | SIGMOD | 4.756212e-05 |
| 7,784 | Authenticated Online Data Integration Services | 2015 | SIGMOD | 4.6517065e-05 |
| 8,849 | SourceSight: Enabling Effective Source Selection | 2016 | SIGMOD | 4.4369118e-05 |
| 11,895 | Finding Quality in Quantity: The Challenge of Discovering Valuable Sources for Integration | 2015 | CIDR | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 371 | A Bayesian Approach to Discovering Truth from Conflicting Sources for Data Integration | 2012 | VLDB | 0.00025389696 |
| 489 | Data Curation at Scale: The Data Tamer System | 2013 | CIDR | 0.00022030728 |
| 1,211 | Truth Finding on the Deep Web: Is the Problem Solved? | 2013 | VLDB | 0.00013257101 |
| 1,246 | Truth Discovery and Copying Detection in a Dynamic World | 2009 | VLDB | 0.0001307161 |
| 4,101 | Less is More: Selecting Sources Wisely for Integration | 2013 | VLDB | 6.4523909e-05 |
Previous
Page 1 / 1
Next