Towards an Objective Metric for Data Value Through Relevance
Summary: Introduce an objective metric for data value via formalized 'data relevance' and efficient computation/maintenance for evolving tabular and semi-structured datasets. Apply relevance-driven value to prioritize storage/query layout, guide curation/pricing/catalogs, and manage 'dark data' and workload–data interactions. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Boris Glavic
- 2. Pengyuan Li
- 3. Ziyu Liu
- 4. Dieter Gawlick
- 5. Vasudha Krishnaswamy
- 6. Danica Porobic
- 7. Zhen Hua Liu
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 61 | DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases | 1997 | VLDB | 0.00064329285 |
| 610 | Goods: Organizing Google's Datasets | 2016 | SIGMOD | 0.00019232674 |
| 939 | Data Lake Management: Challenges and Opportunities | 2019 | VLDB | 0.00015187344 |
| 2,359 | Data Market Platforms: Trading Data Assets to Solve Data Problems | 2020 | VLDB | 8.9607667e-05 |
| 2,743 | Toward Practical Query Pricing with QueryMarket | 2013 | SIGMOD | 8.1897331e-05 |
| 2,764 | The Semiring Framework for Database Provenance | 2017 | PODS | 8.1574444e-05 |
| 4,361 | The Complexity of Resilience and Responsibility for Self-Join-Free Conjunctive Queries | 2016 | VLDB | 6.2559141e-05 |
| 8,886 | Provenance-based Data Skipping | 2022 | VLDB | 4.4279829e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,529 | Data-Driven Domain Discovery for Structured Datasets | 2020 | VLDB | 5.4566641e-05 |
| 10,624 | Evaluating Methods for Efficient Entity Count Estimation | 2025 | VLDB | 4.1945683e-05 |
| 507 | Data Quality and Data Cleaning: An Overview | 2003 | SIGMOD | 0.00021473263 |
| 5,794 | Discovering Related Data At Scale | 2021 | VLDB | 5.3245122e-05 |
| 12,355 | Information Theory For Data Management | 2009 | VLDB | 4.1945683e-05 |
| 2,832 | Intensional Associations Between Data and Metadata | 2007 | SIGMOD | 8.050082e-05 |
| 1,510 | Summarizing Relational Databases | 2009 | VLDB | 0.00011606901 |
| 9,175 | Efficient Exploration of Interesting Aggregates in RDF Graphs | 2021 | SIGMOD | 4.383548e-05 |
| 10,501 | Relevance Queries for Interval Data | 2025 | SIGMOD | 4.1945683e-05 |
| 12,436 | K-Relevance: A Spectrum of Relevance for Data Sources Impacting a Query | 2007 | SIGMOD | 4.1945683e-05 |