Database Paper Browser

Back to papers

Integration of Heterogeneous Databases Without Common Domains Using Queries Based on Textual Similarity

Summary: Rejects global-domain normalization; integrates heterogeneous databases via textual-similarity WHIRL. Efficient WHIRL; experiments show fast inference, competitive with hand-coded normalization, and outperforms exact matching under a plausible global domain. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
3014
Venue
SIGMOD
Year
1998
Pagerank
0.00041055843
Overall Rank
150 | 98.96%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 34 of 34 citing papers.

Rank Citing Paper Year Venue Pagerank
48 Data Integration: A Theoretical Perspective 2002 PODS 0.00069720859
74 Efficient Query Evaluation on Probabilistic Databases 2004 VLDB 0.00057857292
125 Approximate String Joins in a Database (Almost) for Free 2001 VLDB 0.00044847972
155 Robust and Efficient Fuzzy Match for Online Data Cleaning 2003 SIGMOD 0.00040637896
199 Declarative Data Cleaning: Language, Model, and Algorithms 2001 VLDB 0.00035041015
266 Efficient Exact Set-Similarity Joins 2006 VLDB 0.00029718727
280 Eliminating Fuzzy Duplicates in Data Warehouses 2002 VLDB 0.00029113044
394 An Adaptive Query Execution System for Data Integration* 1999 SIGMOD 0.00024460855
427 Automated Ranking of Database Query Results 2003 CIDR 0.0002352637
902 Statistical Schema Matching across Web Query Interfaces 2003 SIGMOD 0.00015486247
1,096 Minimal Probing: Supporting Expensive Predicates for Top-k Queries 2002 SIGMOD 0.00014120512
1,202 VGRAM: Improving Performance of Approximate Queries on String Collections Using Variable-Length Grams 2007 VLDB 0.00013326298
1,533 Example-driven Design of Efficient Record Matching Queries 2007 VLDB 0.00011471971
1,992 Probabilistic Ranking of Database Query Results 2004 VLDB 9.8462684e-05
2,012 DB&IR: Both Sides Now (Extended Abstract) 2007 SIGMOD 9.7951657e-05
2,447 WISE-Integrator: An Automatic Integrator of Web Search Interfaces for E-Commerce 2003 VLDB 8.8037197e-05
2,599 Integrating DB and IR Technologies: What is the Sound of One Hand Clapping? * 2005 CIDR 8.4702307e-05
3,110 Learning to Create Data-Integrating Queries 2008 VLDB 7.5475982e-05
3,168 Query Containment for Data Integration Systems 2000 PODS 7.4508875e-05
3,267 Benchmarking Declarative Approximate Selection Predicates 2007 SIGMOD 7.3058429e-05
3,490 Leveraging Set Relations in Exact Set Similarity Join 2017 VLDB 7.0465856e-05
3,529 Merging the Results of Approximate Match Operations 2004 VLDB 7.0059524e-05
3,667 Querying Structured Text in an XML Database 2003 SIGMOD 6.8602249e-05
4,050 An Efficient Partition Based Method for Exact Set Similarity Joins 2016 VLDB 6.4953612e-05
4,137 Exploiting Content Redundancy for Web Information Extraction 2010 VLDB 6.4181549e-05
4,435 Sampling Dirty Data for Matching Attributes 2010 SIGMOD 6.1918164e-05
4,619 Crowd-Based Deduplication: An Adaptive Approach 2015 SIGMOD 6.0444854e-05
5,122 Providing Database-like Access to the Web Using Queries Based on Textual Similarity 1998 SIGMOD 5.6803757e-05
5,571 HAMSTER: Using Search Clicklogs for Schema and Taxonomy Matching 2009 VLDB 5.4283499e-05
6,792 Automatically Incorporating New Sources in Keyword Search-Based Data Integration 2010 SIGMOD 4.9249098e-05
7,256 Effective and Efficient Retrieval of Structured Entities 2020 VLDB 4.7869419e-05
7,374 Sharing Work in Keyword Search over Databases 2011 SIGMOD 4.7494134e-05
9,725 On Concise Set of Relative Candidate Keys 2014 VLDB 4.2945121e-05
12,656 The Denodo Data Integration Platform 2002 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 10 of 10 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers