The Power of Two Min-Hashes for Similarity Search among Hierarchical Data Objects
Summary: Sketching/LSH for leaf-labeled hierarchical objects (weighted trees) using min-hash propagation to capture an EMD-like minimum-superimposition distance (set-of-sets view). Prove one propagated min-hash gives poor guarantees while two min-hashes suffice to obtain strong collision-separation properties for similarity search. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 34 | Similarity Search in High Dimensions via Hashing | 1999 | VLDB | 0.00076637636 |
| 1,390 | Change Detection in Hierarchically Structured Information | 1996 | SIGMOD | 0.00012248349 |
| 4,406 | Approximate Matching of Hierarchical Data Using pq-Grams | 2005 | VLDB | 6.2141638e-05 |
Previous
Page 1 / 1
Next