Database Paper Browser

Back to papers

Measuring the Structural Similarity of Semistructured Documents Using Entropy

Summary: Entropy-based measure of structural similarity for semistructured documents using extracted structure and Ziv-Lempel or Ziv-Merhav crossparsing to compute entropy. Claims the first linear-time approach for this problem, with clustering results rivaling existing methods. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
9512
Venue
VLDB
Year
2007
Pagerank
4.4803734e-05
Overall Rank
8,632 | 39.95%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 1 of 1 citing papers.

Rank Citing Paper Year Venue Pagerank
3,758 Keyword Search over Relational Databases: A Metadata Approach 2011 SIGMOD 6.7824746e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 11 of 11 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers