DataProf: Semantic Profiling for Iterative Data Cleansing and Business Rule Acquisition
Summary: DataProf: a semantic data profiler for iterative cleansing and rule acquisition. It returns perfect sample records that satisfy the same constraints as the input data, enabling rapid detection of violations and interactive, expert-guided rule discovery, including embedded uniqueness constraints. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Ziheng Wei
- 2. Sebastian Link
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,818 | Embedded Functional Dependencies and Data-completeness Tailored Database Design | 2019 | VLDB | 6.7300958e-05 |
| 7,766 | ICARUS: Minimizing Human Effort in Iterative Data Completion | 2018 | VLDB | 4.6564959e-05 |
| 8,085 | Discovery and Ranking of Embedded Uniqueness Constraints | 2019 | VLDB | 4.5902231e-05 |
| 8,836 | Fast Approximate Denial Constraint Discovery | 2023 | VLDB | 4.4393184e-05 |
| 9,749 | Efficient Differential Dependency Discovery | 2024 | VLDB | 4.2897489e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 25 | Dependency Inference (Extended Abstract) | 1987 | VLDB | 0.00083101742 |
| 894 | A Hybrid Approach to Functional Dependency Discovery | 2016 | SIGMOD | 0.00015556428 |
| 3,467 | Data Profiling – A Tutorial | 2017 | SIGMOD | 7.069081e-05 |
| 4,499 | Possible and Certain SQL Keys | 2015 | VLDB | 6.1385333e-05 |
| 11,854 | SQL Schema Design: Foundations, Normal Forms, and Normalization | 2016 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,205 | ANMAT: Automatic Knowledge Discovery and Error Detection through Pattern Functional Dependencies | 2019 | SIGMOD | 5.630869e-05 |
| 7,071 | Smart Drill-Down: A New Data Exploration Operator | 2015 | VLDB | 4.8429461e-05 |
| 1,625 | Data Profiling with Metanome | 2015 | VLDB | 0.00011094926 |
| 3,467 | Data Profiling – A Tutorial | 2017 | SIGMOD | 7.069081e-05 |
| 5,660 | Descriptive and Prescriptive Data Cleaning | 2014 | SIGMOD | 5.3847321e-05 |
| 6,008 | Apollo: A Dataset Profiling and Operator Modeling System | 2019 | SIGMOD | 5.2415551e-05 |
| 7,926 | CoCo: Interactive Exploration of Conformance Constraints for Data Understanding and Data Cleaning | 2021 | SIGMOD | 4.6144554e-05 |
| 4,929 | Data Auditor: Exploring Data Quality and Semantics using Pattern Tableaux | 2010 | VLDB | 5.8217296e-05 |
| 732 | Discovering Data Quality Rules | 2008 | VLDB | 0.00017465093 |
| 6,384 | A Demonstration of DBWipes: Clean as You Query | 2012 | VLDB | 5.0880333e-05 |