| 221 |
Deep Entity Matching with Pre-Trained Language Models |
2021 |
VLDB |
0.00033121824 |
| 1,914 |
Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks |
2020 |
SIGMOD |
0.00010109102 |
| 2,349 |
RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation |
2021 |
VLDB |
8.9876423e-05 |
| 2,364 |
Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries |
2020 |
SIGMOD |
8.9554751e-05 |
| 3,140 |
ZeroER: Entity Resolution using Zero Labeled Examples |
2020 |
SIGMOD |
7.4841763e-05 |
| 3,396 |
Automatic Data Repair: Are We Ready to Deploy? |
2024 |
VLDB |
7.1455126e-05 |
| 3,400 |
ELPIS: Graph-Based Similarity Search for Scalable Data Science |
2023 |
VLDB |
7.1405533e-05 |
| 3,640 |
Deep Learning for Blocking in Entity Matching: A Design Space Exploration |
2021 |
VLDB |
6.8891671e-05 |
| 3,831 |
Kamino: Constraint-Aware Differentially Private Data Synthesis |
2021 |
VLDB |
6.7181688e-05 |
| 3,915 |
A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs |
2020 |
VLDB |
6.6332294e-05 |
| 3,942 |
Ember: No-Code Context Enrichment via Similarity-Based Keyless Joins |
2022 |
VLDB |
6.6114622e-05 |
| 4,212 |
Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration |
2023 |
SIGMOD |
6.3555142e-05 |
| 4,359 |
Astrid: Accurate Selectivity Estimation for String Predicates using Deep Learning |
2021 |
VLDB |
6.2569955e-05 |
| 4,731 |
Graph-Based Vector Search: An Experimental Evaluation of the State-of-the-Art |
2025 |
SIGMOD |
5.966659e-05 |
| 4,837 |
Entity Resolution with Hierarchical Graph Attention Networks |
2022 |
SIGMOD |
5.8892326e-05 |
| 4,967 |
Leva: Boosting Machine Learning Performance with Relational Embedding Data Augmentation |
2022 |
SIGMOD |
5.7956612e-05 |
| 5,282 |
Deep Indexed Active Learning for Matching Heterogeneous Entity Representations |
2022 |
VLDB |
5.5864206e-05 |
| 5,533 |
Dual-Objective Fine-Tuning of BERT for Entity Matching |
2021 |
VLDB |
5.4544359e-05 |
| 5,869 |
Demonstration of Panda: A Weakly Supervised Entity Matching System |
2021 |
VLDB |
5.2959029e-05 |
| 6,042 |
MDedup: Duplicate Detection with Matching Dependencies |
2020 |
VLDB |
5.2405269e-05 |
| 6,569 |
Domain Adaptation for Deep Entity Resolution |
2022 |
SIGMOD |
5.0065379e-05 |
| 6,690 |
Parallel Discrepancy Detection and Incremental Detection |
2021 |
VLDB |
4.9621556e-05 |
| 6,711 |
Analyzing How BERT Performs Entity Matching |
2022 |
VLDB |
4.9517546e-05 |
| 6,894 |
TableDC: Deep Clustering for Tabular Data |
2025 |
SIGMOD |
4.8925595e-05 |
| 7,052 |
Pre-trained Embeddings for Entity Resolution: An Experimental Analysis |
2023 |
VLDB |
4.8497453e-05 |
| 7,613 |
ADnEV: Cross-Domain Schema Matching using Deep Similarity Matrix Adjustment and Evaluation |
2020 |
VLDB |
4.6961059e-05 |
| 8,005 |
Online Topic-Aware Entity Resolution Over Incomplete Data Streams |
2021 |
SIGMOD |
4.6081461e-05 |
| 8,099 |
Sparkly: A Simple yet Surprisingly Strong TF/IDF Blocker for Entity Matching |
2023 |
VLDB |
4.5859317e-05 |
| 8,153 |
Deep Transfer Learning for Multi-source Entity Linkage via Domain Adaptation |
2022 |
VLDB |
4.574554e-05 |
| 8,406 |
DADER: Hands-Off Entity Resolution with Domain Adaptation |
2022 |
VLDB |
4.5220083e-05 |
| 8,908 |
Deep Active Alignment of Knowledge Graph Entities and Schemata |
2023 |
SIGMOD |
4.427232e-05 |
| 8,911 |
PromptEM: Prompt-tuning for Low-resource Generalized Entity Matching |
2023 |
VLDB |
4.427232e-05 |
| 8,958 |
FlexER: Flexible Entity Resolution for Multiple Intents |
2023 |
SIGMOD |
4.4210635e-05 |
| 9,077 |
VerifAI: Verified Generative AI |
2024 |
CIDR |
4.4010762e-05 |
| 9,235 |
ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries |
2025 |
VLDB |
4.3690661e-05 |
| 9,355 |
Discovering Top-k Rules using Subjective and Objective Criteria |
2023 |
SIGMOD |
4.3514328e-05 |
| 9,402 |
CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models |
2024 |
SIGMOD |
4.3441378e-05 |
| 9,408 |
Experimental Analysis of Large-scale Learnable Vector Storage Compression |
2024 |
VLDB |
4.3441378e-05 |
| 9,409 |
Ground Truth Inference for Weakly Supervised Entity Matching |
2023 |
SIGMOD |
4.3441378e-05 |
| 9,434 |
Rock: Cleaning Data by Embedding ML in Logic Rules |
2024 |
SIGMOD |
4.3430376e-05 |
| 9,460 |
The Battleship Approach to the Low Resource Entity Matching Problem |
2023 |
SIGMOD |
4.3366491e-05 |
| 9,487 |
Making It Tractable to Catch Duplicates and Conflicts in Graphs |
2023 |
SIGMOD |
4.3341665e-05 |
| 9,683 |
Hierarchical Entity Resolution using an Oracle |
2022 |
SIGMOD |
4.3047774e-05 |
| 9,830 |
Towards Autonomous, Hands-Free Data Exploration |
2020 |
CIDR |
4.2751057e-05 |
| 9,846 |
HyperBlocker: Accelerating Rule-based Blocking in Entity Resolution using GPUs |
2025 |
VLDB |
4.2721228e-05 |
| 9,896 |
Towards Interpretable and Learnable Risk Analysis for Entity Resolution |
2020 |
SIGMOD |
4.2600049e-05 |
| 10,022 |
In-context Clustering-based Entity Resolution with Large Language Models: A Design Space Exploration |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,040 |
3dSAGER: Geospatial Entity Resolution over 3D Objects |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,486 |
Rule-Based Graph Cleaning with GPUs on a Single Machine |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,499 |
Privacy and Accuracy-Aware AI/ML Model Deduplication |
2025 |
SIGMOD |
4.1945683e-05 |