Database Paper Browser

Back to papers

Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-V

Summary: Explain-Da-V: framework that explains semantic changes between dataset versions by synthesizing concise data-transformation explanations. Proposes validity, generalizability and explainability metrics and empirically outperforms prior transformation-synthesis methods. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13020
Venue
VLDB
Year
2023
Pagerank
5.5896735e-05
Overall Rank
5,280 | 63.27%
DOI
10.14778/3583140.3583169

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 5 of 5 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 30 of 30 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
221 Deep Entity Matching with Pre-Trained Language Models 2021 VLDB 0.00033121824
728 Meaningful Change Detection in Structured Data 1997 SIGMOD 0.00017494982
939 Data Lake Management: Challenges and Opportunities 2019 VLDB 0.00015187344
1,047 Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms 2015 VLDB 0.00014459715
1,099 Interpretable and Informative Explanations of Outcomes 2015 VLDB 0.00014096312
1,187 JOSIE: Overlap Set Similarity Search for Finding Joinable Tables in Data Lakes 2019 SIGMOD 0.00013443639
1,267 Foofah: Transforming Data By Example 2017 SIGMOD 0.00012936483
1,281 DataHub: Collaborative Data Science & Dataset Version Management at Scale 2015 CIDR 0.00012854744
1,390 Change Detection in Hierarchically Structured Information 1996 SIGMOD 0.00012248349
1,469 BlinkFill: Semi-supervised Programming By Example for Syntactic String Transformations 2016 VLDB 0.00011836053
1,565 Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff 2015 VLDB 0.00011345567
2,037 OrpheusDB: Bolt-on Versioning for Relational Databases 2017 VLDB 9.7120139e-05
2,730 Open Data Integration 2018 VLDB 8.2126735e-05
3,230 Learning Semantic String Transformations from Examples 2012 VLDB 7.339123e-05
3,252 Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks 2020 SIGMOD 7.3178277e-05
3,478 Transform-Data-by-Example (TDE): An Extensible Search Engine for Data Transformations 2018 VLDB 7.054159e-05
3,690 Navigating the Data Lake with DATAMARAN: Automatically Extracting Structure from Log Datasets 2018 SIGMOD 6.8384476e-05
3,735 Auto-Join: Joining Tables by Leveraging Transformations 2017 VLDB 6.8061318e-05
4,859 Integrating Data Lake Tables 2023 VLDB 5.8732433e-05
5,096 Auto-Transform: Learning-to-Transform by Patterns 2020 VLDB 5.7011825e-05
5,242 Towards Benchmarking Feature Type Inference for AutoML Platforms 2021 SIGMOD 5.6074743e-05
5,383 Auto-Pipeline: Synthesizing Complex Data Pipelines By-Target Using Reinforcement Learning and Search 2021 VLDB 5.5393038e-05
5,506 Exploring Change – A New Dimension of Data Analytics 2019 VLDB 5.473324e-05
6,475 Explain3D: Explaining Disagreements in Disjoint Datasets 2019 VLDB 5.0497183e-05
6,679 SQUARES : A SQL Synthesizer Using Query Reverse Engineering 2020 VLDB 4.9656458e-05
7,613 ADnEV: Cross-Domain Schema Matching using Deep Similarity Matrix Adjustment and Evaluation 2020 VLDB 4.6961059e-05
7,994 TardisDB: Extending SQL to Support Versioning 2021 SIGMOD 4.61099e-05
8,338 DBChEx: Interactive Exploration of Data and Schema Change 2019 CIDR 4.5434254e-05
8,958 FlexER: Flexible Entity Resolution for Multiple Intents 2023 SIGMOD 4.4210635e-05
9,076 DataDiff: User-Interpretable Data Transformation Summaries for Collaborative Data Analysis 2018 SIGMOD 4.401804e-05
Previous Page 1 / 1 Next

Semantically Similar Papers