| 1,482 |
Automating Large-Scale Data Quality Verification |
2018 |
VLDB |
0.00011725533 |
| 1,894 |
Baran: Effective Error Correction via a Unified Context Representation and Transfer Learning |
2020 |
VLDB |
0.0001018378 |
| 2,077 |
Efficient Discovery of Approximate Dependencies |
2018 |
VLDB |
9.6001836e-05 |
| 2,253 |
Efficient Denial Constraint Discovery with Hydra |
2018 |
VLDB |
9.1937209e-05 |
| 2,483 |
Discovery of Approximate (and Exact) Denial Constraints |
2020 |
VLDB |
8.6864916e-05 |
| 2,650 |
Detecting Logic Bugs of Join Optimizations in DBMS |
2023 |
SIGMOD |
8.3708191e-05 |
| 2,968 |
Raha: A Configuration-Free Error Detection System |
2019 |
SIGMOD |
7.7985097e-05 |
| 2,982 |
FastQRE: Fast Query Reverse Engineering |
2018 |
SIGMOD |
7.7801984e-05 |
| 3,467 |
Data Profiling – A Tutorial |
2017 |
SIGMOD |
7.069081e-05 |
| 3,797 |
Stitching Web Tables for Improving Matching Quality |
2017 |
VLDB |
6.7597149e-05 |
| 4,127 |
A Statistical Perspective on Discovering Functional Dependencies in Noisy Data |
2020 |
SIGMOD |
6.4310458e-05 |
| 5,383 |
Auto-Pipeline: Synthesizing Complex Data Pipelines By-Target Using Reinforcement Learning and Search |
2021 |
VLDB |
5.5393038e-05 |
| 5,613 |
Distributed implementations of dependency discovery algorithms |
2019 |
VLDB |
5.4102298e-05 |
| 6,092 |
Observatory: Characterizing Embeddings of Relational Tables |
2024 |
VLDB |
5.2138566e-05 |
| 7,076 |
Mining Approximate Acyclic Schemes from Relations |
2020 |
SIGMOD |
4.8426354e-05 |
| 7,185 |
Certus: An Effective Entity Resolution Approach with Graph Differential Dependencies (GDDs) |
2019 |
VLDB |
4.8066159e-05 |
| 7,287 |
Discovering Association Rules from Big Graphs |
2022 |
VLDB |
4.7762276e-05 |
| 7,366 |
Discovery Algorithms for Embedded Functional Dependencies |
2020 |
SIGMOD |
4.7515248e-05 |
| 7,838 |
Auto-Validate: Unsupervised Data Validation Using Data-Domain Patterns Inferred from Data Lakes |
2021 |
SIGMOD |
4.6377995e-05 |
| 8,085 |
Discovery and Ranking of Embedded Uniqueness Constraints |
2019 |
VLDB |
4.5902231e-05 |
| 8,472 |
Rapidash: Efficient Detection of Constraint Violations |
2024 |
VLDB |
4.5036378e-05 |
| 8,475 |
DataProf: Semantic Profiling for Iterative Data Cleansing and Business Rule Acquisition |
2018 |
SIGMOD |
4.5028904e-05 |
| 8,703 |
Workload-driven, Lazy Discovery of Data Dependencies for Query Optimization |
2022 |
CIDR |
4.4647237e-05 |
| 8,836 |
Fast Approximate Denial Constraint Discovery |
2023 |
VLDB |
4.4393184e-05 |
| 8,850 |
Hitting Set Enumeration with Partial Information for Unique Column Combination Discovery |
2020 |
VLDB |
4.4364648e-05 |
| 9,355 |
Discovering Top-k Rules using Subjective and Objective Criteria |
2023 |
SIGMOD |
4.3514328e-05 |
| 9,410 |
Leveraging Application Data Constraints to Optimize Database-Backed Web Applications |
2023 |
VLDB |
4.3441378e-05 |
| 9,487 |
Making It Tractable to Catch Duplicates and Conflicts in Graphs |
2023 |
SIGMOD |
4.3341665e-05 |
| 9,646 |
Discovering Functional Dependencies through Hitting Set Enumeration |
2024 |
SIGMOD |
4.3109001e-05 |
| 9,649 |
DAFDiscover: Robust Mining Algorithm for Dynamic Approximate Functional Dependencies on Dirty Data |
2024 |
VLDB |
4.3109001e-05 |
| 9,749 |
Efficient Differential Dependency Discovery |
2024 |
VLDB |
4.2897489e-05 |
| 9,847 |
Discovering Top-k Relevant and Diversified Rules |
2024 |
SIGMOD |
4.2721228e-05 |
| 9,963 |
Parallel Rule Discovery from Large Datasets by Sampling |
2022 |
SIGMOD |
4.2294678e-05 |
| 10,029 |
Outliers: The Good, the Bad and the Ugly |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,489 |
Incremental Rule Discovery in Response to Parameter Updates |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,508 |
Synthesizing Third Normal Form Schemata that Minimize Integrity Maintenance and Update Overheads: Parameterizing 3NF by the Numbers of Minimal Keys and Functional Dependencies |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,587 |
Efficient Discovery of Relaxed Functional Dependencies |
2025 |
VLDB |
4.1945683e-05 |
| 10,679 |
How and Why False Denial Constraints are Discovered |
2025 |
VLDB |
4.1945683e-05 |
| 10,791 |
FDepHunter: Harnessing Negative Examples to Expose Fakes and Reveal Ghosts |
2025 |
VLDB |
4.1945683e-05 |
| 11,001 |
Capturing More Associations by Referencing External Graphs |
2024 |
VLDB |
4.1945683e-05 |
| 11,010 |
Mixed Covers of Keys and Functional Dependencies for Maintaining the Integrity of Data under Updates |
2024 |
VLDB |
4.1945683e-05 |
| 11,024 |
SplitDF: Splitting Dataframes for Memory-Efficient Data Analysis |
2024 |
VLDB |
4.1945683e-05 |
| 11,173 |
Composite Object Normal Forms: Parameterizing Boyce-Codd Normal Form by the Number of Minimal Keys |
2023 |
SIGMOD |
4.1945683e-05 |
| 11,366 |
Statistical Schema Learning using Occam's Razor |
2022 |
SIGMOD |
4.1945683e-05 |
| 11,490 |
Logical Schema Design that Quantifies Update Inefficiency and Join Efficiency |
2021 |
SIGMOD |
4.1945683e-05 |
| 11,546 |
Making DBMSes Dependency-Aware |
2020 |
CIDR |
4.1945683e-05 |