Web-scale Data Integration: You can only afford to Pay As You Go
Summary: Argues that traditional tightly-coupled integration fails at web scale (Deep Web, Google Base) due to extreme heterogeneity and scale. Proposes PAYGO, a dataspaces-inspired pay-as-you-go architecture that delivers incremental, best-effort, cost-aware integration to maximize utility under limited resources. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Jayant Madhavan
- 2. Shawn R. Jeffery
- 3. Shirley Cohen
- 4. Xin (Luna) Dong
- 5. David Ko
- 6. Cong Yu
- 7. Alon Halevy
Incoming Citations (Sorted by Pagerank)
Showing 28 of 28 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 14 of 14 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 12,852 | Data Integration in the Large: The Challenge of Reuse | 1994 | VLDB | 4.1945683e-05 |
| 12,611 | Architectural Issues and Solutions in the Development of Data-Intensive Web Applications | 2003 | CIDR | 4.1945683e-05 |
| 2,730 | Open Data Integration | 2018 | VLDB | 8.2126735e-05 |
| 672 | An Interactive Clustering-based Approach to Integrating Source Query Interfaces on the Deep Web | 2004 | SIGMOD | 0.00018355746 |
| 6,586 | Web Data Management | 2011 | SIGMOD | 5.0023398e-05 |
| 1,851 | An Analysis of Structured Data on the Web | 2012 | VLDB | 0.00010327871 |
| 12,170 | Schema-As-You-Go: On Probabilistic Tagging and Querying of Wide Tables | 2011 | SIGMOD | 4.1945683e-05 |
| 3,724 | Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web | 2005 | CIDR | 6.8173288e-05 |
| 12,223 | Schema Clustering and Retrieval for Multi-domain Pay-As-You-Go Data Integration Systems | 2010 | SIGMOD | 4.1945683e-05 |
| 1,858 | Bootstrapping Pay-As-You-Go Data Integration Systems | 2008 | SIGMOD | 0.00010301124 |