| 192 |
HoloClean: Holistic Data Repairs with Probabilistic Inference |
2017 |
VLDB |
0.00035728858 |
| 224 |
CORDS: Automatic Discovery of Correlations and Soft Functional Dependencies |
2004 |
SIGMOD |
0.00032746205 |
| 489 |
Data Curation at Scale: The Data Tamer System |
2013 |
CIDR |
0.00022030728 |
| 555 |
Discovering Denial Constraints |
2013 |
VLDB |
0.00020254908 |
| 674 |
Supporting Top-k Join Queries in Relational Databases |
2003 |
VLDB |
0.00018327585 |
| 833 |
Guided Data Repair |
2011 |
VLDB |
0.00016138432 |
| 1,012 |
NADEEF: A Commodity Data Cleaning System |
2013 |
SIGMOD |
0.0001464733 |
| 1,262 |
RankSQL: Query Algebra and Optimization for Relational Top-k Queries |
2005 |
SIGMOD |
0.00012986539 |
| 1,277 |
The Data Civilizer System |
2017 |
CIDR |
0.00012879695 |
| 1,337 |
HoloDetect: Few-Shot Learning for Error Detection |
2019 |
SIGMOD |
0.00012497164 |
| 1,542 |
Efficient Search for the Top-k Probable Nearest Neighbors in Uncertain Databases |
2008 |
VLDB |
0.00011456321 |
| 1,546 |
KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing |
2015 |
SIGMOD |
0.00011446851 |
| 1,612 |
Detecting Data Errors: Where are we and what needs to be done? |
2016 |
VLDB |
0.00011142794 |
| 1,624 |
Sampling the Repairs of Functional Dependency Violations under Hard Constraints |
2010 |
VLDB |
0.00011099222 |
| 1,627 |
Data Cleaning: Overview and Emerging Challenges |
2016 |
SIGMOD |
0.00011086905 |
| 2,319 |
Expressive and Flexible Access to Web-Extracted Data: A Keyword-based Structured Query Language |
2010 |
SIGMOD |
9.0387108e-05 |
| 2,320 |
High-Throughput Vector Similarity Search in Knowledge Graphs |
2023 |
SIGMOD |
9.0366225e-05 |
| 2,393 |
Rank-aware Query Optimization |
2004 |
SIGMOD |
8.9016542e-05 |
| 2,883 |
Joining Ranked Inputs in Practice |
2002 |
VLDB |
7.9656673e-05 |
| 2,946 |
BigDansing: A System for Big Data Cleansing |
2015 |
SIGMOD |
7.8372441e-05 |
| 3,014 |
Ranking with Uncertain Scoring Functions: Semantics and Sensitivity Measures |
2011 |
SIGMOD |
7.70946e-05 |
| 3,360 |
Modeling and Querying Possible Repairs in Duplicate Detection |
2009 |
VLDB |
7.1742067e-05 |
| 3,440 |
Approximate Denial Constraints |
2020 |
VLDB |
7.0918817e-05 |
| 3,528 |
Distributed Data Deduplication |
2016 |
VLDB |
7.0066139e-05 |
| 3,582 |
NADEEF/ER: Generic and Interactive Entity Resolution |
2014 |
SIGMOD |
6.9479263e-05 |
| 3,711 |
Saga: A Platform for Continuous Construction and Serving of Knowledge At Scale |
2022 |
SIGMOD |
6.823609e-05 |
| 3,807 |
Supporting Ad-hoc Ranking Aggregates |
2006 |
SIGMOD |
6.747576e-05 |
| 3,831 |
Kamino: Constraint-Aware Differentially Private Data Synthesis |
2021 |
VLDB |
6.7181688e-05 |
| 3,942 |
Ember: No-Code Context Enrichment via Similarity-Based Keyless Joins |
2022 |
VLDB |
6.6114622e-05 |
| 4,397 |
Estimating Compilation Time of a Query Optimizer |
2003 |
SIGMOD |
6.2230918e-05 |
| 4,559 |
Creating Competitive Products |
2009 |
VLDB |
6.0857166e-05 |
| 4,695 |
DataXFormer: An Interactive Data Transformation Tool |
2015 |
SIGMOD |
5.9927993e-05 |
| 4,801 |
CLAMS: Bringing Quality to Data Lakes |
2016 |
SIGMOD |
5.9115269e-05 |
| 5,058 |
A Demo of the Data Civilizer System |
2017 |
SIGMOD |
5.7280139e-05 |
| 5,538 |
Growing and Serving Large Open-domain Knowledge Graphs |
2023 |
SIGMOD |
5.4509524e-05 |
| 5,613 |
Distributed implementations of dependency discovery algorithms |
2019 |
VLDB |
5.4102298e-05 |
| 5,660 |
Descriptive and Prescriptive Data Cleaning |
2014 |
SIGMOD |
5.3847321e-05 |
| 5,729 |
KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing |
2015 |
VLDB |
5.3506368e-05 |
| 5,758 |
Top-k Nearest Neighbor Search In Uncertain Data Series |
2015 |
VLDB |
5.339397e-05 |
| 5,815 |
StatAdvisor: Recommending Statistical Views |
2009 |
VLDB |
5.3165295e-05 |
| 5,937 |
DataXFormer: Leveraging the Web for Semantic Transformations |
2015 |
CIDR |
5.2650964e-05 |
| 6,065 |
APEx: Accuracy-Aware Differentially Private Data Exploration |
2019 |
SIGMOD |
5.2291685e-05 |
| 6,350 |
NADEEF: A Generalized Data Cleaning System |
2013 |
VLDB |
5.101815e-05 |
| 6,546 |
Properties of Inconsistency Measures for Databases |
2021 |
SIGMOD |
5.0185588e-05 |
| 6,882 |
RankSQL: Supporting Ranking Queries in Relational Database Management Systems |
2005 |
VLDB |
4.8963901e-05 |
| 7,013 |
Qualitative Data Cleaning |
2016 |
VLDB |
4.8619024e-05 |
| 7,653 |
CORDS: Automatic Generation of Correlation Statistics in DB2 |
2004 |
VLDB |
4.6875371e-05 |
| 8,372 |
URank: Formulation and Efficient Evaluation of Top-k Queries in Uncertain Databases |
2007 |
SIGMOD |
4.532996e-05 |
| 9,310 |
FIX: Feature-based Indexing Technique for XML Documents |
2006 |
VLDB |
4.3570863e-05 |
| 9,697 |
PCOR: Private Contextual Outlier Release via Differentially Private Search |
2021 |
SIGMOD |
4.3022295e-05 |