DataDiff: User-Interpretable Data Transformation Summaries for Collaborative Data Analysis
Summary: DataDiff provides user-interpretable data-diff summaries for collaborative dataset versioning. It yields concise explanations of changes without dependence on the originating operations, aiding merge, conflict detection, and evolution reasoning. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Gunce Su Yilmaz
- 2. Tana Wattanawaroon
- 3. Liqi Xu
- 4. Abhishek Nigam
- 5. Aaron J. Elmore
- 6. Aditya Parameswaran
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,280 | Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-V | 2023 | VLDB | 5.5896735e-05 |
| 11,216 | Demystifying the QoS and QoE of Edge-hosted Video Streaming Applications in the Wild with SNESet | 2023 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 11 of 11 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 192 | HoloClean: Holistic Data Repairs with Probabilistic Inference | 2017 | VLDB | 0.00035728858 |
| 492 | Query by Output | 2009 | SIGMOD | 0.00021974699 |
| 1,281 | DataHub: Collaborative Data Science & Dataset Version Management at Scale | 2015 | CIDR | 0.00012854744 |
| 1,572 | Reverse Engineering Complex Join Queries | 2013 | SIGMOD | 0.00011298251 |
| 2,037 | OrpheusDB: Bolt-on Versioning for Relational Databases | 2017 | VLDB | 9.7120139e-05 |
| 2,269 | Ground: A Data Context Service | 2017 | CIDR | 9.147379e-05 |
| 2,430 | Decibel: The Relational Dataset Branching System | 2016 | VLDB | 8.8330417e-05 |
| 2,750 | Learning and Verifying Quantified Boolean Queries by Example | 2013 | PODS | 8.176296e-05 |
| 2,913 | Performance Evaluation of a Temporal Database Management System | 1986 | SIGMOD | 7.9126252e-05 |
| 3,911 | The BT-Tree: A Branched and Temporal Access Method | 2000 | VLDB | 6.6359583e-05 |
| 7,254 | DEX: Query Execution in a Delta-based Storage System | 2017 | SIGMOD | 4.7885915e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,749 | Efficient Differential Dependency Discovery | 2024 | VLDB | 4.2897489e-05 |
| 13,280 | Effective Data Versioning for Collaborative Data Analytics | 2020 | SIGMOD | - |
| 11,319 | Building a Shared Conceptual Model of Complex, Heterogeneous Data Systems: A Demonstration | 2022 | CIDR | 4.1945683e-05 |
| 10,875 | SDEcho: Efficient Explanation of Aggregated Sequence Difference | 2025 | VLDB | 4.1945683e-05 |
| 13,686 | Efficient development of data migration transformations | 2004 | SIGMOD | - |
| 1,565 | Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff | 2015 | VLDB | 0.00011345567 |
| 11,297 | DataRinse: Semantic Transforms for Data preparation based on Code Mining | 2023 | VLDB | 4.1945683e-05 |
| 2,154 | DIFF: A Relational Interface for Large-Scale Data Explanation | 2019 | VLDB | 9.4208667e-05 |
| 1,281 | DataHub: Collaborative Data Science & Dataset Version Management at Scale | 2015 | CIDR | 0.00012854744 |
| 5,280 | Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-V | 2023 | VLDB | 5.5896735e-05 |