Database Paper Browser

Back to papers

BibFinder/StatMiner: Effectively Mining and Using Coverage and Overlap Statistics in Data Integration

Summary: StatMiner learns coverage and overlap for data integration via a hierarchical classifier and thresholded learning to adapt resolution. Demonstrates on BibFinder, autonomous sources with uneven coverage, learned stats speeding query processing. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
9064
Venue
VLDB
Year
2003
Pagerank
5.4160529e-05
Overall Rank
5,600 | 61.05%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Rank Citing Paper Year Venue Pagerank
11,985 Online Ordering of Overlapping Data Sources 2014 VLDB 4.1945683e-05
12,178 Large-Scale Copy Detection 2011 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 3 of 3 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
36 Fast Algorithms for Mining Association Rules 1994 VLDB 0.00076161096
1,289 Using Probabilistic Information in Data Integration 1997 VLDB 0.00012804879
3,170 Quality-driven Integration of Heterogeneous Information Systems 1999 VLDB 7.4482367e-05
Previous Page 1 / 1 Next

Semantically Similar Papers