Type Classification of Semi-Structured Documents
Summary: Experimental vector-space classifier for automatic type inference of semi-structured documents; provides explicit typing to support object-oriented techniques. Targets high recall/precision, fast speed, and extensibility, with empirical evaluation of accuracy and performance. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Markus Tresch
- 2. Neal Palmer
- 3. Allen Luniewski
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 393 | From Structured Documents to Novel Query Facilities | 1994 | SIGMOD | 0.00024524092 |
| 2,292 | The Rufus System: Information Organization for Semi-Structured Data | 1993 | VLDB | 9.0904272e-05 |
| 2,569 | Optimizing Queries on Files | 1994 | SIGMOD | 8.5218077e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 13,949 | What do those weird XML types want, anyway? | 1999 | VLDB | - |
| 12,223 | Schema Clustering and Retrieval for Multi-domain Pay-As-You-Go Data Integration Systems | 2010 | SIGMOD | 4.1945683e-05 |
| 8,632 | Measuring the Structural Similarity of Semistructured Documents Using Entropy | 2007 | VLDB | 4.4803734e-05 |
| 637 | Automatic segmentation of text into structured records | 2001 | SIGMOD | 0.00018824614 |
| 12,649 | Fast and accurate text classification via multiple linear discriminant projections | 2002 | VLDB | 4.1945683e-05 |
| 4,092 | Structured Annotations of Web Queries | 2010 | SIGMOD | 6.4561959e-05 |
| 3,485 | Using taxonomy, discriminants, and signatures for navigating in text databases | 1997 | VLDB | 7.0504959e-05 |
| 13,021 | The Type Concept in Office Document Retrieval | 1985 | VLDB | 4.1945683e-05 |
| 1,606 | Enhanced hypertext categorization using hyperlinks | 1998 | SIGMOD | 0.00011174873 |
| 1,163 | Extracting Schema from Semistructured Data | 1998 | SIGMOD | 0.00013577466 |