Extracting and Querying a Comprehensive Web Database
Summary: Omnivore builds a comprehensive web-scale entity-relationship DB by running multiple domain-independent extractors (tables, text, relations) in parallel over a crawl and merging heterogeneous outputs to overcome model-specific blind spots. Provides SQL-like and search interfaces, supports user corrections, and automatically selects output model/schema to render results without prior metadata. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,652 | From Information to Knowledge: Harvesting Entities and Relationships from Web Sources | 2010 | PODS | 5.3903671e-05 |
| 9,359 | IQ: The Case for Iterative Querying for Knowledge | 2011 | CIDR | 4.3509599e-05 |
| 12,044 | Knowledge Harvesting in the Big-Data Era | 2013 | SIGMOD | 4.1945683e-05 |
| 12,244 | DoCQS: A Prototype System for Supporting Data-oriented Content Query | 2010 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 62 | Freebase: A Collaboratively Created Graph Database For Structuring Human Knowledge | 2008 | SIGMOD | 0.0006429466 |
| 107 | WebTables: Exploring the Power of Tables on the Web | 2008 | VLDB | 0.00048377684 |
| 188 | Applying Model Management to Classical Meta Data Problems | 2003 | CIDR | 0.00035968389 |
| 229 | Reference Reconciliation in Complex Information Spaces | 2005 | SIGMOD | 0.00032242633 |
| 1,395 | Structured Querying of Web Text: A Technical Challenge | 2007 | CIDR | 0.00012207039 |
| 1,722 | Building Structured Web Community Portals: A Top-Down, Compositional, and Incremental Approach | 2007 | VLDB | 0.00010757784 |
| 1,980 | Snowball: A Prototype System for Extracting Relations from Large Text Collections | 2001 | SIGMOD | 9.8785341e-05 |
Previous
Page 1 / 1
Next