Juneau: Data Lake Management for Jupyter
Summary: Juneau extends Jupyter as an instrumentation layer for data-lake management in collaborative labs. It enables indexing, search, and recommendations of complementary datasets, features, and training data to boost reuse and discovery in notebooks. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Yi Zhang
- 2. Zachary G. Ives
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,644 | Finding Related Tables in Data Lakes for Interactive Data Science | 2020 | SIGMOD | 0.00011041787 |
| 1,751 | Auctus: A Dataset Search Engine for Data Discovery and Augmentation | 2021 | VLDB | 0.00010683295 |
| 4,774 | LIMA: Fine-grained Lineage Tracing and Reuse in Machine Learning Systems | 2021 | SIGMOD | 5.9316087e-05 |
| 5,794 | Discovering Related Data At Scale | 2021 | VLDB | 5.3245122e-05 |
| 7,582 | LakeCompass: An End-to-End System for Data Maintenance, Search and Analysis in Data Lakes | 2024 | VLDB | 4.7046388e-05 |
| 10,836 | Data Discovery in Data Lakes: Operations, Indexes, Systems | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7 | Optimal Aggregation Algorithms for Middleware [Extended Abstract] | 2001 | PODS | 0.0015496097 |
| 1,178 | Table Union Search on Open Data | 2018 | VLDB | 0.00013468118 |
| 1,277 | The Data Civilizer System | 2017 | CIDR | 0.00012879695 |
| 1,958 | Exemplar Queries: Give me an Example of What You Need | 2014 | VLDB | 9.9572632e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 939 | Data Lake Management: Challenges and Opportunities | 2019 | VLDB | 0.00015187344 |
| 1,281 | DataHub: Collaborative Data Science & Dataset Version Management at Scale | 2015 | CIDR | 0.00012854744 |
| 3,252 | Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks | 2020 | SIGMOD | 7.3178277e-05 |
| 13,291 | Towards Understanding Data Analysis Workflows using a Large Notebook Corpus | 2019 | SIGMOD | - |
| 11,316 | Kyrix-J: Visual Discovery of Connected Datasets in a Data Lake | 2022 | CIDR | 4.1945683e-05 |
| 3,347 | Collaborative Data Analytics with DataHub | 2015 | VLDB | 7.1921364e-05 |
| 13,230 | Automating State Management in Computational Notebooks | 2021 | CIDR | - |
| 11,063 | Searching Data Lakes for Nested and Joined Data | 2024 | VLDB | 4.1945683e-05 |
| 1,644 | Finding Related Tables in Data Lakes for Interactive Data Science | 2020 | SIGMOD | 0.00011041787 |
| 6,981 | Dataset Relationship Management | 2019 | CIDR | 4.8743957e-05 |