Database Paper Browser

Back to papers

Enriching Relations with Additional Attributes for ER

Summary: Defines relation enrichment: select a few KG-derived attributes to augment a relation's schema to maximize entity-resolution accuracy; problem is intractable. Uses tuple–KG linking and a reinforcement-learning policy jointly trained with ER to extract diverse, low-null attributes incrementally, yielding up to 33% ER gain. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13527
Venue
VLDB
Year
2024
Pagerank
4.1945683e-05
Overall Rank
11,054 | 23.10%
DOI
10.14778/3681954.3681987

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 41 of 41 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
8 Extending the Data Base Relational Model to Capture More Meaning 1979 SIGMOD 0.0015385917
49 Consistent Query Answers in Inconsistent Databases 1999 PODS 0.00067660624
192 HoloClean: Holistic Data Repairs with Probabilistic Inference 2017 VLDB 0.00035728858
221 Deep Entity Matching with Pre-Trained Language Models 2021 VLDB 0.00033121824
229 Reference Reconciliation in Complex Information Spaces 2005 SIGMOD 0.00032242633
280 Eliminating Fuzzy Duplicates in Data Warehouses 2002 VLDB 0.00029113044
300 Deep Learning for Entity Matching: A Design Space Exploration 2018 SIGMOD 0.00028441466
509 On Active Learning of Record Matching Packages 2010 SIGMOD 0.00021409518
513 TURL: Table Understanding through Representation Learning 2021 VLDB 0.00021288342
517 Can Foundation Models Wrangle Your Data? 2023 VLDB 0.00021169035
754 Distributed Representations of Tuples for Entity Resolution 2018 VLDB 0.00017117211
903 To Join or Not to Join? Thinking Twice about Joins before Feature Selection 2016 SIGMOD 0.0001547016
1,159 Towards Certain Fixes with Editing Rules and Master Data 2010 VLDB 0.00013592813
1,178 Table Union Search on Open Data 2018 VLDB 0.00013468118
1,187 JOSIE: Overlap Set Similarity Search for Finding Joinable Tables in Data Lakes 2019 SIGMOD 0.00013443639
1,463 ARDA: Automatic Relational Data Augmentation for Machine Learning 2020 VLDB 0.00011869295
1,644 Finding Related Tables in Data Lakes for Interactive Data Science 2020 SIGMOD 0.00011041787
1,894 Baran: Effective Error Correction via a Unified Context Representation and Transfer Learning 2020 VLDB 0.0001018378
2,589 DogmatiX Tracks down Duplicates in XML 2005 SIGMOD 8.4847146e-05
2,836 Semantics-aware Dataset Discovery from Data Lakes with Contextualized Column-based Representation Learning 2023 VLDB 8.0443826e-05
3,000 SANTOS: Relationship-based Semantic Table Union Search 2023 SIGMOD 7.7462128e-05
3,394 Incremental Graph Computations: Doable and Undoable 2017 SIGMOD 7.1480446e-05
3,824 Correlation Sketches for Approximate Join-Correlation Queries 2021 SIGMOD 6.7260705e-05
4,129 Are Key-Foreign Key Joins Safe to Avoid when Learning High-Capacity Classifiers? 2018 VLDB 6.428887e-05
4,332 Missing Value Imputation on Multidimensional Time Series 2021 VLDB 6.2805243e-05
4,967 Leva: Boosting Machine Learning Performance with Relational Embedding Data Augmentation 2022 SIGMOD 5.7956612e-05
5,041 KBPearl: A Knowledge Base Population System Supported by Joint Entity and Relation Linking 2020 VLDB 5.741618e-05
5,192 Pattern Functional Dependencies for Data Cleaning 2020 VLDB 5.6375087e-05
5,460 Relative Information Completeness 2009 PODS 5.4957751e-05
5,691 Putting Things into Context: Rich Explanations for Query Answers using Join Graphs 2021 SIGMOD 5.3684557e-05
5,978 Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond 2021 SIGMOD 5.2453012e-05
6,042 MDedup: Duplicate Detection with Matching Dependencies 2020 VLDB 5.2405269e-05
6,449 Causal Data Integration 2023 VLDB 5.0587746e-05
6,690 Parallel Discrepancy Detection and Incremental Detection 2021 VLDB 4.9621556e-05
6,727 ORBITS: Online Recovery of Missing Values in Multiple Time Series Streams 2021 VLDB 4.9483604e-05
6,810 Record Linkage with Uniqueness Constraints and Erroneous Values 2010 VLDB 4.9203397e-05
6,892 Identifying Insufficient Data Coverage for Ordinal Continuous-Valued Attributes 2021 SIGMOD 4.8925683e-05
7,179 Coresets over Multiple Tables for Feature-rich and Data-efficient Machine Learning 2023 VLDB 4.8078895e-05
7,634 ReStore - Neural Data Completion for Relational Databases 2021 SIGMOD 4.6911382e-05
8,911 PromptEM: Prompt-tuning for Low-resource Generalized Entity Matching 2023 VLDB 4.427232e-05
11,223 Splitting Tuples of Mismatched Entities 2023 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Semantically Similar Papers