The Data Civilizer System
Summary: DATA CIVILIZER builds a linkage graph to discover relevant enterprise datasets and join paths, federates execution across heterogeneous stores via a polystore, and integrates cleaning into query processing. Adds a workflow engine for flexible, update-aware pipelines. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Dong Deng
- 2. Raul Castro Fernandez
- 3. Ziawasch Abedjan
- 4. Sibo Wang
- 5. Michael Stonebraker
- 6. Ahmed Elmagarmid
- 7. Ihab F. Ilyas
- 8. Samuel Madden
- 9. Mourad Ouzzani
- 10. Nan Tang
Incoming Citations (Sorted by Pagerank)
Showing 4 of 54 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,069 | Hardware-Efficient Data Imputation through DBMS Extensibility | 2024 | VLDB | 4.1945683e-05 |
| 11,150 | Zed: Leveraging Data Types to Process Eclectic Data | 2023 | CIDR | 4.1945683e-05 |
| 11,343 | SPINE: Scaling up Programming-by-Negative-Example for String Filtering and Transformation | 2022 | SIGMOD | 4.1945683e-05 |
| 11,547 | CAFE: Constraint-Aware Feature Extraction from Large Databases | 2020 | CIDR | 4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 13 of 13 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 199 | Declarative Data Cleaning: Language, Model, and Algorithms | 2001 | VLDB | 0.00035041015 |
| 13,238 | DataMingler: A Novel Approach to Data Virtualization | 2021 | SIGMOD | - |
| 12,321 | Linkage Query Writer | 2009 | VLDB | 4.1945683e-05 |
| 2,367 | Here are my Data Files. Here are my Queries. Where are my Results? | 2011 | CIDR | 8.9511058e-05 |
| 11,874 | Graph-based Exploration of Non-graph Datasets | 2016 | VLDB | 4.1945683e-05 |
| 11,319 | Building a Shared Conceptual Model of Complex, Heterogeneous Data Systems: A Demonstration | 2022 | CIDR | 4.1945683e-05 |
| 489 | Data Curation at Scale: The Data Tamer System | 2013 | CIDR | 0.00022030728 |
| 2,946 | BigDansing: A System for Big Data Cleansing | 2015 | SIGMOD | 7.8372441e-05 |
| 8,000 | Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics | 2019 | VLDB | 4.6092803e-05 |
| 5,058 | A Demo of the Data Civilizer System | 2017 | SIGMOD | 5.7280139e-05 |