Web Data Management
Summary: Survey of Web Data Management (WDM) techniques fusing Web-scale structured data (tables, lists, forms), crowdsourced KBs (Wikipedia, DBpedia, YAGO, Freebase), and Web text to extract facts. Shows how cross-dataset fragments enable novel results (schema thesaurus construction) and discusses collaboration implications of Web-scale data sourcing. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,494 | Incomplete Data: What Went Wrong, and How to Fix It | 2014 | PODS | 5.4759469e-05 |
| 5,717 | Query Processing under GLAV Mappings for Relational and Graph Databases | 2013 | VLDB | 5.3553228e-05 |
| 11,975 | Which Concepts Are Worth Extracting? | 2014 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 62 | Freebase: A Collaboratively Created Graph Database For Structuring Human Knowledge | 2008 | SIGMOD | 0.0006429466 |
| 107 | WebTables: Exploring the Power of Tables on the Web | 2008 | VLDB | 0.00048377684 |
| 364 | Annotating and Searching Web Tables Using Entities, Types and Relationships | 2010 | VLDB | 0.00025637562 |
| 2,066 | DBLife: A Community Information Management Platform for the Database Research Community | 2007 | CIDR | 9.6399561e-05 |
| 4,229 | Harnessing the Deep Web: Present and Future | 2009 | CIDR | 6.3399547e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 13,927 | Managing Web Data | 1999 | SIGMOD | - |
| 12,114 | Database Techniques for Linked Data Management | 2012 | SIGMOD | 4.1945683e-05 |
| 5,933 | Capturing and Querying Multiple Aspects of Semistructured Data | 1999 | VLDB | 5.2668503e-05 |
| 107 | WebTables: Exploring the Power of Tables on the Web | 2008 | VLDB | 0.00048377684 |
| 12,369 | Effective and Efficient Semantic Web Data Management over DB2 | 2008 | SIGMOD | 4.1945683e-05 |
| 3,155 | Ten Years of WebTables | 2018 | VLDB | 7.4672742e-05 |
| 1,851 | An Analysis of Structured Data on the Web | 2012 | VLDB | 0.00010327871 |
| 13,588 | Databases on the Web | 2007 | SIGMOD | - |
| 12,401 | Large-Scale Collaborative Analysis and Extraction of Web Data | 2008 | VLDB | 4.1945683e-05 |
| 13,820 | Data Management: Lasting Impact of the Wild, Wild, Web | 2001 | SIGMOD | - |