Using Probabilistic Information in Data Integration
Summary: Proposes probabilistic knowledge to guide query planning in mediator systems, modeling overlap, coverage, and relationships. Offers a declarative formalism and ranking algorithms to order sources for early results. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Daniela Florescu
- 2. Daphne Koller
- 3. Alon Levy
Incoming Citations (Sorted by Pagerank)
Showing 9 of 9 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 371 | A Bayesian Approach to Discovering Truth from Conflicting Sources for Data Integration | 2012 | VLDB | 0.00025389696 |
| 394 | An Adaptive Query Execution System for Data Integration* | 1999 | SIGMOD | 0.00024460855 |
| 721 | Data Integration with Uncertainty | 2007 | VLDB | 0.00017570539 |
| 1,606 | Enhanced hypertext categorization using hyperlinks | 1998 | SIGMOD | 0.00011174873 |
| 3,170 | Quality-driven Integration of Heterogeneous Information Systems | 1999 | VLDB | 7.4482367e-05 |
| 5,548 | Foundations of Uncertain-Data Integration | 2010 | VLDB | 5.4446854e-05 |
| 5,600 | BibFinder/StatMiner: Effectively Mining and Using Coverage and Overlap Statistics in Data Integration | 2003 | VLDB | 5.4160529e-05 |
| 6,941 | Estimating the Impact of Unknown Unknowns on Aggregate Query Results | 2016 | SIGMOD | 4.8924e-05 |
| 11,985 | Online Ordering of Overlapping Data Sources | 2014 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 2 of 2 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 127 | Querying Heterogeneous Information Sources Using Source Descriptions | 1996 | VLDB | 0.00044642203 |
| 253 | Query Caching and Optimization in Distributed Mediator Systems | 1996 | SIGMOD | 0.00030569863 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,590 | Answering Queries from Statistics and Probabilistic Views | 2005 | VLDB | 8.483194e-05 |
| 1,697 | A Probabilistic Framework for Vague Queries and Imprecise Information in Databases | 1990 | VLDB | 0.00010873473 |
| 1,992 | Probabilistic Ranking of Database Query Results | 2004 | VLDB | 9.8462684e-05 |
| 12,694 | A Case-Based Approach to Information Integration | 2000 | VLDB | 4.1945683e-05 |
| 1,858 | Bootstrapping Pay-As-You-Go Data Integration Systems | 2008 | SIGMOD | 0.00010301124 |
| 13,813 | Querying Partially Sound and Complete Data Sources | 2001 | PODS | - |
| 12,223 | Schema Clustering and Retrieval for Multi-domain Pay-As-You-Go Data Integration Systems | 2010 | SIGMOD | 4.1945683e-05 |
| 13,602 | Information Discovery in Loosely Integrated Data | 2007 | SIGMOD | - |
| 721 | Data Integration with Uncertainty | 2007 | VLDB | 0.00017570539 |
| 2,432 | Computing Capabilities of Mediators | 1999 | SIGMOD | 8.8290243e-05 |