Rock: Cleaning Data by Embedding ML in Logic Rules
Summary: Rock unifies ML and logic by embedding classifiers as predicates in rules for entity resolution, conflict resolution, timeliness, and imputation. Batch/incremental rule learning, error detection, and corrections from rules and ground truth. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Xianchun Bao
- 2. Zian Bao
- 3. Binbin Bie
- 4. QingSong Duan
- 5. Wenfei Fan
- 6. Hui Lei
- 7. Daji Li
- 8. Wei Lin
- 9. Peng Liu
- 10. Zhicong Lv
- 11. Mingliang Ouyang
- 12. Shuai Tang
- 13. Yaoshu Wang
- 14. Qiyuan Wei
- 15. Min Xie
- 16. Jing Zhang
- 17. Xin Zhang
- 18. Runxiao Zhao
- 19. Shuping Zhou
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,716 | nsDB: Architecting the Next Generation Database by Integrating Neural and Symbolic Systems | 2024 | VLDB | 4.4618187e-05 |
| 9,846 | HyperBlocker: Accelerating Rule-based Blocking in Entity Resolution using GPUs | 2025 | VLDB | 4.2721228e-05 |
| 9,847 | Discovering Top-k Relevant and Diversified Rules | 2024 | SIGMOD | 4.2721228e-05 |
| 10,029 | Outliers: The Good, the Bad and the Ugly | 2026 | SIGMOD | 4.1945683e-05 |
| 10,478 | Data Enhancement for Binary Classification of Relational Data | 2025 | SIGMOD | 4.1945683e-05 |
| 10,489 | Incremental Rule Discovery in Response to Parameter Updates | 2025 | SIGMOD | 4.1945683e-05 |
| 11,111 | Rock: Cleaning Data with both ML and Logic Rules | 2024 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 44 of 44 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 192 | HoloClean: Holistic Data Repairs with Probabilistic Inference | 2017 | VLDB | 0.00035728858 |
| 1,627 | Data Cleaning: Overview and Emerging Challenges | 2016 | SIGMOD | 0.00011086905 |
| 4,904 | Temporal Rules Discovery for Web Data Cleaning | 2016 | VLDB | 5.8399195e-05 |
| 10,022 | In-context Clustering-based Entity Resolution with Large Language Models: A Design Space Exploration | 2026 | SIGMOD | 4.1945683e-05 |
| 11,223 | Splitting Tuples of Mismatched Entities | 2023 | SIGMOD | 4.1945683e-05 |
| 3,192 | Towards Dependable Data Repairing with Fixing Rules | 2014 | SIGMOD | 7.4095761e-05 |
| 9,278 | Interactive and Deterministic Data Cleaning: A Tossed Stone Raises a Thousand Ripples | 2016 | SIGMOD | 4.3639892e-05 |
| 7,867 | Learning Over Dirty Data Without Cleaning | 2020 | SIGMOD | 4.6320452e-05 |
| 9,487 | Making It Tractable to Catch Duplicates and Conflicts in Graphs | 2023 | SIGMOD | 4.3341665e-05 |
| 11,111 | Rock: Cleaning Data with both ML and Logic Rules | 2024 | VLDB | 4.1945683e-05 |