Joint Unsupervised Structure Discovery and Information Extraction
Summary: JUDIE: joint unsupervised extraction of semi-structured records from continuous text with no delimiters. Structure Discovery groups labels into records by frequent repetitions, enabling iterative refinement of extraction without supervision. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,690 | Navigating the Data Lake with DATAMARAN: Automatically Extracting Structure from Log Datasets | 2018 | SIGMOD | 6.8384476e-05 |
| 3,742 | TEGRA: Table Extraction by Global Record Alignment | 2015 | SIGMOD | 6.7966898e-05 |
| 6,135 | Extracting Logical Hierarchical Structure of HTML Documents Based on Headings | 2015 | VLDB | 5.1930114e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 427 | Automated Ranking of Database Query Results | 2003 | CIDR | 0.0002352637 |
| 637 | Automatic segmentation of text into structured records | 2001 | SIGMOD | 0.00018824614 |
| 12,230 | ONDUX: On-Demand Unsupervised Learning for Information Extraction | 2010 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next