Dataset Relationship Management
Summary: Proposes a Dataset Relationship Management System (DRMS) to manage dataset- and data-product-level operations: reuse, provenance/context, rapid revision, retargeting, and contribution metrics across many datasets. Advocates using Jupyter notebooks to capture fine-grained dataset provenance and describes JuNEAU prototype. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Zachary G. Ives
- 2. Yi Zhang
- 3. Soonbo Han
- 4. Nan Zheng
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,053 | Optimizing Machine Learning Workloads in Collaborative Environments | 2020 | SIGMOD | 5.2326838e-05 |
| 10,820 | APEX-DAG: Library and Language independent Pipeline EXtraction | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 19 of 19 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,524 | Provenance Management in Curated Databases | 2006 | SIGMOD | 8.6017899e-05 |
| 11,396 | DPDS: Assisting Data Science with Data Provenance | 2022 | VLDB | 4.1945683e-05 |
| 13,364 | Data Visualization Management Systems | 2015 | CIDR | - |
| 711 | A Case for A Collaborative Query Management System | 2009 | CIDR | 0.00017751589 |
| 13,291 | Towards Understanding Data Analysis Workflows using a Large Notebook Corpus | 2019 | SIGMOD | - |
| 4,003 | Data Platform for Machine Learning | 2019 | SIGMOD | 6.54347e-05 |
| 1,281 | DataHub: Collaborative Data Science & Dataset Version Management at Scale | 2015 | CIDR | 0.00012854744 |
| 11,319 | Building a Shared Conceptual Model of Complex, Heterogeneous Data Systems: A Demonstration | 2022 | CIDR | 4.1945683e-05 |
| 1,644 | Finding Related Tables in Data Lakes for Interactive Data Science | 2020 | SIGMOD | 0.00011041787 |
| 4,595 | Juneau: Data Lake Management for Jupyter | 2019 | VLDB | 6.060188e-05 |