Back to papers
Lenses: An On-Demand Approach to ETL
Summary: General, extensible on-demand ETL using probabilistic query processing for incremental data curation and quality–effort trade-offs. UI and a greedy CPI ranking for on-demand curation tasks; shows applicability beyond Paygo/HLog and ETL feasibility.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 11035
- Venue
- VLDB
- Year
- 2015
- Pagerank
- 5.3307398e-05
- Overall Rank
- 5,779 | 59.80%
- DOI
-
-
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 11 of 11 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 2,573 |
Query Optimization for Dynamic Imputation |
2017 |
VLDB |
8.518235e-05 |
| 4,426 |
Data Debugging and Exploration with Vizier |
2019 |
SIGMOD |
6.1969994e-05 |
| 4,664 |
Efficient Answering of Historical What-if Queries |
2022 |
SIGMOD |
6.0127053e-05 |
| 4,806 |
Uncertainty Annotated Databases - A Lightweight Approach for Approximating Certain Answers |
2019 |
SIGMOD |
5.9092698e-05 |
| 6,295 |
Your notebook is not crumby enough, REPLace it |
2020 |
CIDR |
5.1249204e-05 |
| 7,941 |
Efficient Uncertainty Tracking for Complex Queries with Attribute-level Bounds |
2021 |
SIGMOD |
4.613363e-05 |
| 8,340 |
Beta Probabilistic Databases: A Scalable Approach to Belief Updating and Parameter Learning |
2017 |
SIGMOD |
4.5433598e-05 |
| 9,044 |
Efficient Approximation of Certain and Possible Answers for Ranking and Window Queries over Uncertain Data |
2023 |
VLDB |
4.4039656e-05 |
| 9,851 |
Adaptive Schema Databases |
2017 |
CIDR |
4.2721228e-05 |
| 10,377 |
FastPDB: Towards Bag-Probabilistic Queries at Interactive Speeds |
2025 |
SIGMOD |
4.1945683e-05 |
| 11,069 |
Hardware-Efficient Data Imputation through DBMS Extensibility |
2024 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 19 of 19 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 14 |
Online Aggregation |
1997 |
SIGMOD |
0.0010801504 |
| 31 |
Provenance Semirings |
2007 |
PODS |
0.0007857786 |
| 299 |
Trio: A System for Data, Uncertainty, and Lineage |
2006 |
VLDB |
0.00028525071 |
| 321 |
MCDB: A Monte Carlo Approach to Managing Uncertain Data |
2008 |
SIGMOD |
0.00027527389 |
| 494 |
Data Exchange: Getting to the Core |
2003 |
PODS |
0.00021805832 |
| 692 |
Pay-as-you-go User Feedback for Dataspace Systems |
2008 |
SIGMOD |
0.00018083948 |
| 706 |
MYSTIQ: A system for finding more answers by using probabilities |
2005 |
SIGMOD |
0.00017845469 |
| 980 |
BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models |
2008 |
VLDB |
0.00014879747 |
| 2,331 |
Orion 2.0: Native Support for Uncertain Data |
2008 |
SIGMOD |
9.018559e-05 |
| 2,875 |
MayBMS: A Probabilistic Database Management System |
2009 |
SIGMOD |
7.9742313e-05 |
| 2,984 |
Efficiently Incorporating User Feedback into Information Extraction and Integration Programs |
2009 |
SIGMOD |
7.7796344e-05 |
| 3,051 |
Partial Results in Database Systems |
2014 |
SIGMOD |
7.6512591e-05 |
| 6,355 |
User Feedback as a First Class Citizen in Information Integration Systems |
2011 |
CIDR |
5.0987661e-05 |
| 6,858 |
When is Naive Evaluation Possible? |
2013 |
PODS |
4.9060157e-05 |
| 7,224 |
OASSIS: Query Driven Crowd Mining |
2014 |
SIGMOD |
4.7959024e-05 |
| 7,553 |
SPROUT2: A Squared Query Engine for Uncertain Web Data |
2011 |
SIGMOD |
4.7126455e-05 |
| 7,787 |
Jigsaw: Efficient Optimization Over Uncertain Enterprise Data |
2011 |
SIGMOD |
4.6512526e-05 |
| 9,138 |
Management of Flexible Schema Data in RDBMSs - Opportunities and Limitations for NoSQL |
2015 |
CIDR |
4.3869509e-05 |
| 9,359 |
IQ: The Case for Iterative Querying for Knowledge |
2011 |
CIDR |
4.3509599e-05 |
Semantically Similar Papers