Database Paper Browser

Back to papers

DogmatiX Tracks down Duplicates in XML

Summary: DogmatiX extends duplicate detection to XML with a general framework (candidate, duplicate, detection). It uses XML-aware similarity that blends value similarity with structural cues from parents and children, plus XML-specific heuristics, validated empirically. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
3655
Venue
SIGMOD
Year
2005
Pagerank
8.4847146e-05
Overall Rank
2,589 | 82.00%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 12 of 12 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
67 The Merge/Purge Problem for Large Databases 1995 SIGMOD 0.00061348205
112 Potter's Wheel: An Interactive Data Cleaning System 2001 VLDB 0.00047045036
199 Declarative Data Cleaning: Language, Model, and Algorithms 2001 VLDB 0.00035041015
280 Eliminating Fuzzy Duplicates in Data Warehouses 2002 VLDB 0.00029113044
2,784 Approximate XML Joins 2002 SIGMOD 8.128931e-05
Previous Page 1 / 1 Next

Semantically Similar Papers