| 754 |
Distributed Representations of Tuples for Entity Resolution |
2018 |
VLDB |
0.00017117211 |
| 833 |
Guided Data Repair |
2011 |
VLDB |
0.00016138432 |
| 1,012 |
NADEEF: A Commodity Data Cleaning System |
2013 |
SIGMOD |
0.0001464733 |
| 1,277 |
The Data Civilizer System |
2017 |
CIDR |
0.00012879695 |
| 1,546 |
KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing |
2015 |
SIGMOD |
0.00011446851 |
| 1,612 |
Detecting Data Errors: Where are we and what needs to be done? |
2016 |
VLDB |
0.00011142794 |
| 2,349 |
RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation |
2021 |
VLDB |
8.9876423e-05 |
| 2,946 |
BigDansing: A System for Big Data Cleansing |
2015 |
SIGMOD |
7.8372441e-05 |
| 2,968 |
Raha: A Configuration-Free Error Detection System |
2019 |
SIGMOD |
7.7985097e-05 |
| 3,130 |
Behavior Based Record Linkage |
2010 |
VLDB |
7.4993061e-05 |
| 3,265 |
RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! - |
2018 |
VLDB |
7.3083672e-05 |
| 3,571 |
Lightning Fast and Space Efficient Inequality Joins |
2015 |
VLDB |
6.9580858e-05 |
| 3,582 |
NADEEF/ER: Generic and Interactive Entity Resolution |
2014 |
SIGMOD |
6.9479263e-05 |
| 3,640 |
Deep Learning for Blocking in Entity Matching: A Design Space Exploration |
2021 |
VLDB |
6.8891671e-05 |
| 3,713 |
GDR: A System for Guided Data Repair |
2010 |
SIGMOD |
6.8224341e-05 |
| 3,976 |
UGuide – User-Guided Discovery of FD-Detectable Errors |
2017 |
SIGMOD |
6.5736462e-05 |
| 4,650 |
LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data |
2016 |
VLDB |
6.0234336e-05 |
| 4,695 |
DataXFormer: An Interactive Data Transformation Tool |
2015 |
SIGMOD |
5.9927993e-05 |
| 4,904 |
Temporal Rules Discovery for Web Data Cleaning |
2016 |
VLDB |
5.8399195e-05 |
| 5,058 |
A Demo of the Data Civilizer System |
2017 |
SIGMOD |
5.7280139e-05 |
| 5,153 |
Horizon: Scalable Dependency-driven Data Cleaning |
2021 |
VLDB |
5.6607963e-05 |
| 5,192 |
Pattern Functional Dependencies for Data Cleaning |
2020 |
VLDB |
5.6375087e-05 |
| 5,205 |
ANMAT: Automatic Knowledge Discovery and Error Detection through Pattern Functional Dependencies |
2019 |
SIGMOD |
5.630869e-05 |
| 5,462 |
RetClean: Retrieval-Based Data Cleaning Using LLMs and Data Lakes |
2024 |
VLDB |
5.494769e-05 |
| 5,660 |
Descriptive and Prescriptive Data Cleaning |
2014 |
SIGMOD |
5.3847321e-05 |
| 5,684 |
Dagger: A Data (not code) Debugger |
2020 |
CIDR |
5.3720749e-05 |
| 5,729 |
KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing |
2015 |
VLDB |
5.3506368e-05 |
| 5,790 |
AQWA: Adaptive Query-Workload-Aware Partitioning of Big Spatial Data |
2015 |
VLDB |
5.3269734e-05 |
| 5,937 |
DataXFormer: Leveraging the Web for Semantic Transformations |
2015 |
CIDR |
5.2650964e-05 |
| 6,350 |
NADEEF: A Generalized Data Cleaning System |
2013 |
VLDB |
5.101815e-05 |
| 6,610 |
bdbms – A Database Management System for Biological Data |
2007 |
CIDR |
4.995269e-05 |
| 8,000 |
Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics |
2019 |
VLDB |
4.6092803e-05 |
| 8,629 |
Spatial Queries with Two kNN Predicates |
2012 |
VLDB |
4.4809879e-05 |
| 8,931 |
Preserving Privacy and Fairness in Peer-to-Peer Data Integration |
2010 |
SIGMOD |
4.427232e-05 |
| 9,306 |
Debugging Large-Scale Data Science Pipelines using Dagger |
2020 |
VLDB |
4.3572942e-05 |
| 9,577 |
CoClean: Collaborative Data Cleaning |
2020 |
SIGMOD |
4.3248438e-05 |
| 9,810 |
Rheem: Enabling Multi-Platform Task Execution |
2016 |
SIGMOD |
4.278405e-05 |
| 9,829 |
Sevi: Speech-to-Visualization through Neural Machine Translation |
2022 |
SIGMOD |
4.2751057e-05 |
| 11,752 |
Lusail: A System for Querying Linked Data at Scale |
2018 |
VLDB |
4.1945683e-05 |
| 11,779 |
A Demonstration of Lusail - Querying Linked Data at Scale |
2017 |
SIGMOD |
4.1945683e-05 |
| 11,943 |
A Demonstration of AQWA: Adaptive Query-Workload-Aware Partitioning of Big Spatial Data |
2015 |
VLDB |
4.1945683e-05 |
| 12,842 |
A Top-Down Approach For Two Level Serializability |
1994 |
VLDB |
4.1945683e-05 |
| 13,340 |
Errata for "Lightning Fast and Space Efficient Inequality Joins" (PVLDB 8(13): 2074-2085) |
2017 |
VLDB |
- |
| 13,490 |
U-MAP: A System for Usage-Based Schema Matching and Mapping |
2011 |
SIGMOD |
- |
| 13,850 |
Ontology-based Support for Digital Government |
2001 |
VLDB |
- |
| 13,942 |
World Wide Database - Integrating the Web, CORBA and Databases |
1999 |
SIGMOD |
- |