Database Paper Browser

Back to papers

The Power of Two Min-Hashes for Similarity Search among Hierarchical Data Objects

Summary: Sketching/LSH for leaf-labeled hierarchical objects (weighted trees) using min-hash propagation to capture an EMD-like minimum-superimposition distance (set-of-sets view). Prove one propagated min-hash gives poor guarantees while two min-hashes suffice to obtain strong collision-separation properties for similarity search. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
1461
Venue
PODS
Year
2008
Pagerank
4.1945683e-05
Overall Rank
12,357 | 14.04%
DOI
-

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 3 of 3 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
34 Similarity Search in High Dimensions via Hashing 1999 VLDB 0.00076637636
1,390 Change Detection in Hierarchically Structured Information 1996 SIGMOD 0.00012248349
4,406 Approximate Matching of Hierarchical Data Using pq-Grams 2005 VLDB 6.2141638e-05
Previous Page 1 / 1 Next

Semantically Similar Papers