Database Paper Browser

Back to papers

XTRACT: A System for Extracting Document Type Descriptors from XML Documents

Summary: XTRACT infers concise, meaningful DTDs by collapsing repeated sequences into regexes and factoring candidates with optimization techniques. It uses MDL to pick the best DTD, showing scalable, accurate schema extraction on real and synthetic XML data. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
3182
Venue
SIGMOD
Year
2000
Pagerank
0.00014799689
Overall Rank
992 | 93.11%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 12 of 12 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 4 of 4 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers