ActiveDeeper: A Model-based Active Data Enrichment System
Summary: ActiveDeeper uses the deep web as a labeler to train a data-enrichment model for local databases. Model-driven enrichment from deep-web data outperforms state-of-the-art in real-world settings; delivered as a Google Sheets add-on. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Liang Zhao
- 2. Qingcan Li
- 3. Pei Wang
- 4. Jiannan Wang
- 5. Eugene Wu
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,434 | Rock: Cleaning Data by Embedding ML in Logic Rules | 2024 | SIGMOD | 4.3430376e-05 |
| 10,029 | Outliers: The Good, the Bad and the Ugly | 2026 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 420 | InfoGather: Entity Augmentation and Attribute Discovery By Holistic Matching with Web Tables | 2012 | SIGMOD | 0.00023719065 |
| 3,229 | InfoGather+: Semantic Matching and Annotation of Numeric and Time-Varying Attributes in Web Tables | 2013 | SIGMOD | 7.3393682e-05 |
| 8,678 | Progressive Deep Web Crawling Through Keyword Queries For Data Enrichment | 2019 | SIGMOD | 4.4702119e-05 |
| 11,722 | Deeper: A Data Enrichment System Powered by Deep Web | 2018 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,229 | Harnessing the Deep Web: Present and Future | 2009 | CIDR | 6.3399547e-05 |
| 8,154 | MetaQuerier: Querying Structured Web Sources On-the-fly | 2005 | SIGMOD | 4.5745458e-05 |
| 667 | Incremental Knowledge Base Construction Using DeepDive | 2015 | VLDB | 0.00018440557 |
| 2,095 | Knocking the Door to the Deep Web: Integrating Web Query Interfaces | 2004 | SIGMOD | 9.5505068e-05 |
| 4,758 | Optimization for Active Learning-based Interactive Database Exploration | 2019 | VLDB | 5.9422515e-05 |
| 1,537 | Google's Deep-Web Crawl | 2008 | VLDB | 0.00011465704 |
| 4,106 | Extracting Databases from Dark Data with DeepDive | 2016 | SIGMOD | 6.4456184e-05 |
| 672 | An Interactive Clustering-based Approach to Integrating Source Query Interfaces on the Deep Web | 2004 | SIGMOD | 0.00018355746 |
| 8,678 | Progressive Deep Web Crawling Through Keyword Queries For Data Enrichment | 2019 | SIGMOD | 4.4702119e-05 |
| 11,722 | Deeper: A Data Enrichment System Powered by Deep Web | 2018 | SIGMOD | 4.1945683e-05 |