| 754 |
Distributed Representations of Tuples for Entity Resolution |
2018 |
VLDB |
0.00017117211 |
| 1,012 |
NADEEF: A Commodity Data Cleaning System |
2013 |
SIGMOD |
0.0001464733 |
| 1,159 |
Towards Certain Fixes with Editing Rules and Master Data |
2010 |
VLDB |
0.00013592813 |
| 1,277 |
The Data Civilizer System |
2017 |
CIDR |
0.00012879695 |
| 1,414 |
Graph Pattern Matching: From Intractable to Polynomial Time |
2010 |
VLDB |
0.00012118275 |
| 1,541 |
Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes |
2023 |
CIDR |
0.00011456579 |
| 1,546 |
KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing |
2015 |
SIGMOD |
0.00011446851 |
| 1,612 |
Detecting Data Errors: Where are we and what needs to be done? |
2016 |
VLDB |
0.00011142794 |
| 1,831 |
Synthesizing Entity Matching Rules by Examples |
2018 |
VLDB |
0.00010384082 |
| 1,892 |
Querying Shortest Paths on Time Dependent Road Networks |
2019 |
VLDB |
0.00010185573 |
| 2,349 |
RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation |
2021 |
VLDB |
8.9876423e-05 |
| 2,607 |
Graph Stream Summarization: From Big Bang to Big Crunch |
2016 |
SIGMOD |
8.4630211e-05 |
| 2,823 |
Interaction between Record Matching and Data Repairing |
2011 |
SIGMOD |
8.0593894e-05 |
| 2,945 |
Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning |
2023 |
SIGMOD |
7.8377395e-05 |
| 2,946 |
BigDansing: A System for Big Data Cleansing |
2015 |
SIGMOD |
7.8372441e-05 |
| 2,968 |
Raha: A Configuration-Free Error Detection System |
2019 |
SIGMOD |
7.7985097e-05 |
| 3,192 |
Towards Dependable Data Repairing with Fixing Rules |
2014 |
SIGMOD |
7.4095761e-05 |
| 3,265 |
RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! - |
2018 |
VLDB |
7.3083672e-05 |
| 3,449 |
Learned Cardinality Estimation: A Design Space Exploration and A Comparative Evaluation |
2022 |
VLDB |
7.0824319e-05 |
| 3,571 |
Lightning Fast and Space Efficient Inequality Joins |
2015 |
VLDB |
6.9580858e-05 |
| 3,582 |
NADEEF/ER: Generic and Interactive Entity Resolution |
2014 |
SIGMOD |
6.9479263e-05 |
| 3,640 |
Deep Learning for Blocking in Entity Matching: A Design Space Exploration |
2021 |
VLDB |
6.8891671e-05 |
| 3,662 |
The Dawn of Natural Language to SQL: Are We Fully Ready? |
2024 |
VLDB |
6.8672143e-05 |
| 3,861 |
Generating Concise Entity Matching Rules |
2017 |
SIGMOD |
6.6878164e-05 |
| 3,970 |
HAIChart: Human and AI Paired Visualization System |
2024 |
VLDB |
6.5784767e-05 |
| 3,976 |
UGuide – User-Guided Discovery of FD-Detectable Errors |
2017 |
SIGMOD |
6.5736462e-05 |
| 4,102 |
GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data |
2023 |
SIGMOD |
6.4522929e-05 |
| 4,212 |
Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration |
2023 |
SIGMOD |
6.3555142e-05 |
| 4,825 |
Synthesizing Natural Language to Visualization (NL2VIS) Benchmarks from NL2SQL Benchmarks |
2021 |
SIGMOD |
5.8946721e-05 |
| 4,908 |
Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL |
2024 |
VLDB |
5.8339245e-05 |
| 5,028 |
Adaptive Data Augmentation for Supervised Learning over Missing Data |
2021 |
VLDB |
5.7506746e-05 |
| 5,058 |
A Demo of the Data Civilizer System |
2017 |
SIGMOD |
5.7280139e-05 |
| 5,192 |
Pattern Functional Dependencies for Data Cleaning |
2020 |
VLDB |
5.6375087e-05 |
| 5,205 |
ANMAT: Automatic Knowledge Discovery and Error Detection through Pattern Functional Dependencies |
2019 |
SIGMOD |
5.630869e-05 |
| 5,381 |
Selective Data Acquisition in the Wild for Model Charging |
2022 |
VLDB |
5.5399508e-05 |
| 5,462 |
RetClean: Retrieval-Based Data Cleaning Using LLMs and Data Lakes |
2024 |
VLDB |
5.494769e-05 |
| 5,469 |
Learned Cardinality Estimation for Similarity Queries |
2021 |
SIGMOD |
5.4898192e-05 |
| 5,484 |
DeepEye: Creating Good Data Visualizations by Keyword Search |
2018 |
SIGMOD |
5.4826544e-05 |
| 5,684 |
Dagger: A Data (not code) Debugger |
2020 |
CIDR |
5.3720749e-05 |
| 5,729 |
KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing |
2015 |
VLDB |
5.3506368e-05 |
| 5,963 |
Automatic Data Acquisition for Deep Learning |
2021 |
VLDB |
5.2526794e-05 |
| 6,280 |
Self-supervised and Interpretable Data Cleaning with Sequence Generative Adversarial Networks |
2023 |
VLDB |
5.1290457e-05 |
| 6,350 |
NADEEF: A Generalized Data Cleaning System |
2013 |
VLDB |
5.101815e-05 |
| 6,569 |
Domain Adaptation for Deep Entity Resolution |
2022 |
SIGMOD |
5.0065379e-05 |
| 6,765 |
Automatic Database Configuration Debugging using Retrieval-Augmented Language Models |
2025 |
SIGMOD |
4.9325583e-05 |
| 6,842 |
Towards Democratizing Relational Data Visualization |
2019 |
SIGMOD |
4.9103931e-05 |
| 7,179 |
Coresets over Multiple Tables for Feature-rich and Data-efficient Machine Learning |
2023 |
VLDB |
4.8078895e-05 |
| 7,582 |
LakeCompass: An End-to-End System for Data Maintenance, Search and Analysis in Data Lakes |
2024 |
VLDB |
4.7046388e-05 |
| 8,000 |
Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics |
2019 |
VLDB |
4.6092803e-05 |
| 8,116 |
LakeBench: A Benchmark for Discovering Joinable and Unionable Tables in Data Lakes |
2024 |
VLDB |
4.581507e-05 |