| 489 |
Data Curation at Scale: The Data Tamer System
|
2013 |
CIDR |
0.00022030728 |
| 555 |
Discovering Denial Constraints
|
2013 |
VLDB |
0.00020254908 |
| 754 |
Distributed Representations of Tuples for Entity Resolution
|
2018 |
VLDB |
0.00017117211 |
| 833 |
Guided Data Repair
|
2011 |
VLDB |
0.00016138432 |
| 881 |
Don’t be SCAREd: Use SCalable Automatic REpairing with Maximal Likelihood and Bounded Changes
|
2013 |
SIGMOD |
0.00015661103 |
| 1,012 |
NADEEF: A Commodity Data Cleaning System
|
2013 |
SIGMOD |
0.0001464733 |
| 1,092 |
E-Store: Fine-Grained Elastic Partitioning for Distributed Transaction Processing Systems
|
2015 |
VLDB |
0.00014135961 |
| 1,197 |
The LLUNATIC Data-Cleaning Framework
|
2013 |
VLDB |
0.00013390321 |
| 1,277 |
The Data Civilizer System
|
2017 |
CIDR |
0.00012879695 |
| 1,426 |
LiveGraph: A Transactional Graph Storage System with Purely Sequential Adjacency List Scans
|
2020 |
VLDB |
0.00012050977 |
| 1,501 |
P-Store: An Elastic Database System with Predictive Provisioning
|
2018 |
SIGMOD |
0.00011664869 |
| 1,541 |
Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes
|
2023 |
CIDR |
0.00011456579 |
| 1,546 |
KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing
|
2015 |
SIGMOD |
0.00011446851 |
| 1,612 |
Detecting Data Errors: Where are we and what needs to be done?
|
2016 |
VLDB |
0.00011142794 |
| 1,831 |
Synthesizing Entity Matching Rules by Examples
|
2018 |
VLDB |
0.00010384082 |
| 1,892 |
Querying Shortest Paths on Time Dependent Road Networks
|
2019 |
VLDB |
0.00010185573 |
| 1,914 |
Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks
|
2020 |
SIGMOD |
0.00010109102 |
| 2,188 |
Effective Indexing for Approximate Constrained Shortest Path Queries on Large Road Networks
|
2017 |
VLDB |
9.3372315e-05 |
| 2,349 |
RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation
|
2021 |
VLDB |
8.9876423e-05 |
| 2,364 |
Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries
|
2020 |
SIGMOD |
8.9554751e-05 |
| 2,574 |
Discovery of Genuine Functional Dependencies from Relational Data with Missing Values
|
2018 |
VLDB |
8.5173637e-05 |
| 2,607 |
Graph Stream Summarization: From Big Bang to Big Crunch
|
2016 |
SIGMOD |
8.4630211e-05 |
| 2,617 |
Extraction and Integration of Partially Overlapping Web Sources
|
2013 |
VLDB |
8.4462621e-05 |
| 2,638 |
Messing Up with BART: Error Generation for Evaluating Data-Cleaning Algorithms
|
2016 |
VLDB |
8.399764e-05 |
| 2,852 |
MRI: Meaningful Interpretations of Collaborative Ratings
|
2011 |
VLDB |
8.0151391e-05 |
| 2,945 |
Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning
|
2023 |
SIGMOD |
7.8377395e-05 |
| 2,946 |
BigDansing: A System for Big Data Cleansing
|
2015 |
SIGMOD |
7.8372441e-05 |
| 2,948 |
Squall: Fine-Grained Live Reconfiguration for Partitioned Main Memory Databases
|
2015 |
SIGMOD |
7.8341023e-05 |
| 2,968 |
Raha: A Configuration-Free Error Detection System
|
2019 |
SIGMOD |
7.7985097e-05 |
| 3,005 |
Clay: Fine-Grained Adaptive Partitioning for General Database Schemas
|
2017 |
VLDB |
7.7303579e-05 |
| 3,130 |
Behavior Based Record Linkage
|
2010 |
VLDB |
7.4993061e-05 |
| 3,140 |
ZeroER: Entity Resolution using Zero Labeled Examples
|
2020 |
SIGMOD |
7.4841763e-05 |
| 3,192 |
Towards Dependable Data Repairing with Fixing Rules
|
2014 |
SIGMOD |
7.4095761e-05 |
| 3,197 |
A Probabilistic Optimization Framework for the Empty-Answer Problem
|
2013 |
VLDB |
7.3955829e-05 |
| 3,265 |
RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! -
|
2018 |
VLDB |
7.3083672e-05 |
| 3,403 |
Piggybacking on Social Networks
|
2013 |
VLDB |
7.136763e-05 |
| 3,449 |
Learned Cardinality Estimation: A Design Space Exploration and A Comparative Evaluation
|
2022 |
VLDB |
7.0824319e-05 |
| 3,571 |
Lightning Fast and Space Efficient Inequality Joins
|
2015 |
VLDB |
6.9580858e-05 |
| 3,582 |
NADEEF/ER: Generic and Interactive Entity Resolution
|
2014 |
SIGMOD |
6.9479263e-05 |
| 3,640 |
Deep Learning for Blocking in Entity Matching: A Design Space Exploration
|
2021 |
VLDB |
6.8891671e-05 |
| 3,675 |
Accordion: Elastic Scalability for Database Systems Supporting Distributed Transactions
|
2014 |
VLDB |
6.8555664e-05 |
| 3,753 |
Choosing A Cloud DBMS: Architectures and Tradeoffs
|
2019 |
VLDB |
6.7871241e-05 |
| 3,861 |
Generating Concise Entity Matching Rules
|
2017 |
SIGMOD |
6.6878164e-05 |
| 3,881 |
Machine Learning and Databases: The Sound of Things to Come or a Cacophony of Hype?
|
2015 |
SIGMOD |
6.6691196e-05 |
| 3,976 |
UGuide – User-Guided Discovery of FD-Detectable Errors
|
2017 |
SIGMOD |
6.5736462e-05 |
| 4,102 |
GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data
|
2023 |
SIGMOD |
6.4522929e-05 |
| 4,212 |
Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration
|
2023 |
SIGMOD |
6.3555142e-05 |
| 4,359 |
Astrid: Accurate Selectivity Estimation for String Predicates using Deep Learning
|
2021 |
VLDB |
6.2569955e-05 |
| 4,650 |
LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data
|
2016 |
VLDB |
6.0234336e-05 |
| 4,667 |
FlexPushdownDB: Hybrid Pushdown and Caching in a Cloud DBMS
|
2021 |
VLDB |
6.0116919e-05 |