Crossing the finish line faster when paddling the Data Lake with KAYAK
Summary: KAYAK offers ad-hoc primitives and incremental execution to generate informative previews for data preparation in data lakes. Metadata-centric design minimizes human effort and scales as the lake evolves, speeding the journey to insights. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,003 | Data Platform for Machine Learning | 2019 | SIGMOD | 6.54347e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 610 | Goods: Organizing Google's Datasets | 2016 | SIGMOD | 0.00019232674 |
| 818 | Finding Related Tables | 2012 | SIGMOD | 0.00016311524 |
| 1,277 | The Data Civilizer System | 2017 | CIDR | 0.00012879695 |
| 1,625 | Data Profiling with Metanome | 2015 | VLDB | 0.00011094926 |
| 1,833 | Data Wrangling: The Challenging Journey from the Wild to the Lake | 2015 | CIDR | 0.00010378976 |
| 2,269 | Ground: A Data Context Service | 2017 | CIDR | 9.147379e-05 |
| 3,281 | Constance: An Intelligent Data Lake System | 2016 | SIGMOD | 7.2823287e-05 |
| 3,347 | Collaborative Data Analytics with DataHub | 2015 | VLDB | 7.1921364e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,907 | Petabyte-Scale Row-Level Operations in Data Lakehouses | 2024 | VLDB | 4.6205839e-05 |
| 13,201 | Examples are All You Need: Iterative Data Discovery by Example in Data Lakes | 2022 | CIDR | - |
| 1,644 | Finding Related Tables in Data Lakes for Interactive Data Science | 2020 | SIGMOD | 0.00011041787 |
| 11,063 | Searching Data Lakes for Nested and Joined Data | 2024 | VLDB | 4.1945683e-05 |
| 7,059 | Adaptive and Robust Query Execution for Lakehouses at Scale | 2024 | VLDB | 4.8477825e-05 |
| 3,690 | Navigating the Data Lake with DATAMARAN: Automatically Extracting Structure from Log Datasets | 2018 | SIGMOD | 6.8384476e-05 |
| 13,277 | The Challenge of Building Effective Data Lakes | 2020 | SIGMOD | - |
| 10,685 | LakeVisage: Towards Scalable, Flexible and Interactive Visualization Recommendation for Data Discovery over Data Lakes | 2025 | VLDB | 4.1945683e-05 |
| 1,833 | Data Wrangling: The Challenging Journey from the Wild to the Lake | 2015 | CIDR | 0.00010378976 |
| 11,732 | CoreKG: a Knowledge Lake Service | 2018 | VLDB | 4.1945683e-05 |