| 263 |
CrowdER: Crowdsourcing Entity Resolution |
2012 |
VLDB |
0.00029862413 |
| 791 |
ActiveClean: Interactive Data Cleaning For Statistical Modeling |
2016 |
VLDB |
0.00016629664 |
| 866 |
Leveraging Transitive Relations for Crowdsourced Joins |
2013 |
SIGMOD |
0.00015801196 |
| 1,345 |
Entity Matching: How Similar Is Similar |
2011 |
VLDB |
0.00012468408 |
| 1,396 |
Can We Beat the Prefix Filtering? An Adaptive Framework for Similarity Join and Search |
2012 |
SIGMOD |
0.00012204748 |
| 1,627 |
Data Cleaning: Overview and Emerging Challenges |
2016 |
SIGMOD |
0.00011086905 |
| 1,703 |
Are We Ready For Learned Cardinality Estimation? |
2021 |
VLDB |
0.00010836769 |
| 2,184 |
A Sample-and-Clean Framework for Fast and Accurate Query Processing on Dirty Data |
2014 |
SIGMOD |
9.3429789e-05 |
| 2,592 |
Pass-Join: A Partition-based Method for Similarity Joins |
2012 |
VLDB |
8.4795761e-05 |
| 2,753 |
Complaint-driven Training Data Debugging for Query 2.0 |
2020 |
SIGMOD |
8.1724339e-05 |
| 3,192 |
Towards Dependable Data Repairing with Fixing Rules |
2014 |
SIGMOD |
7.4095761e-05 |
| 3,263 |
QASCA: A Quality-Aware Task Assignment System for Crowdsourcing Applications |
2015 |
SIGMOD |
7.3097573e-05 |
| 3,299 |
SCODED: Statistical Constraint Oriented Data Error Detection |
2020 |
SIGMOD |
7.2546659e-05 |
| 3,737 |
Skipping-oriented Partitioning for Columnar Layouts |
2017 |
VLDB |
6.8033227e-05 |
| 3,773 |
Cleaning Crowdsourced Labels Using Oracles for Statistical Classification |
2019 |
VLDB |
6.7758649e-05 |
| 3,944 |
AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics |
2018 |
SIGMOD |
6.6078243e-05 |
| 4,216 |
Trie-Join: Efficient Trie-based String Similarity Joins with Edit-Distance Constraints |
2010 |
VLDB |
6.3521675e-05 |
| 4,451 |
CLAMShell: Speeding up Crowds for Low-latency Data Labeling |
2016 |
VLDB |
6.1738675e-05 |
| 4,668 |
PrivateClean: Data Cleaning and Differential Privacy |
2016 |
SIGMOD |
6.0115918e-05 |
| 5,222 |
Enabling SQL-based Training Data Debugging for Federated Learning |
2022 |
VLDB |
5.6210545e-05 |
| 5,929 |
ActiveClean: An Interactive Data Cleaning Framework For Modern Machine Learning |
2016 |
SIGMOD |
5.2682177e-05 |
| 5,981 |
DataPrep.EDA: Task-Centric Exploratory Data Analysis for Statistical Modeling in Python |
2021 |
SIGMOD |
5.2448986e-05 |
| 6,541 |
ConnectorX: Accelerating Data Loading From Databases to Dataframes |
2022 |
VLDB |
5.0216945e-05 |
| 6,779 |
Explaining Inference Queries with Bayesian Optimization |
2021 |
VLDB |
4.9280116e-05 |
| 6,855 |
DBease: Making Databases User-friendly and Easily Accessible |
2011 |
CIDR |
4.9062505e-05 |
| 7,117 |
Crowdsourced Data Management: Overview and Challenges |
2017 |
SIGMOD |
4.826509e-05 |
| 8,593 |
Wisteria: Nurturing Scalable Data Cleaning Infrastructure |
2015 |
VLDB |
4.4891474e-05 |
| 8,643 |
One Size Does Not Fit All: A Bandit-Based Sampler Combination Framework with Theoretical Guarantees |
2022 |
SIGMOD |
4.4777916e-05 |
| 8,678 |
Progressive Deep Web Crawling Through Keyword Queries For Data Enrichment |
2019 |
SIGMOD |
4.4702119e-05 |
| 8,728 |
Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views |
2015 |
VLDB |
4.4589711e-05 |
| 8,853 |
Complaint-Driven Training Data Debugging at Interactive Speeds |
2022 |
SIGMOD |
4.4350727e-05 |
| 9,273 |
ActiveDeeper: A Model-based Active Data Enrichment System |
2020 |
VLDB |
4.3649603e-05 |
| 10,115 |
ST-Raptor: LLM-Powered Semi-Structured Table Question Answering |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,582 |
A Flexible Framework for Query-oriented Interactive Community Search |
2025 |
VLDB |
4.1945683e-05 |
| 10,591 |
Accio: Bolt-on Query Federation |
2025 |
VLDB |
4.1945683e-05 |
| 10,762 |
ParSEval: Plan-aware Test Database Generation for SQL Equivalence Evaluation |
2025 |
VLDB |
4.1945683e-05 |
| 11,722 |
Deeper: A Data Enrichment System Powered by Deep Web |
2018 |
SIGMOD |
4.1945683e-05 |
| 13,194 |
Web Connector: A Unified API Wrapper to Simplify Web Data Collection |
2023 |
VLDB |
- |