A Demo of the Data Civilizer System
Summary: Demo of Data Civilizer, an end-to-end big-data management system for enterprise data. Unified pipeline: data discovery, integration/stitching, cleaning, and cross-storage querying to tame dirty, scattered sources. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Raul Castro Fernandez
- 2. Dong Deng
- 3. Essam Mansour
- 4. Abdulhakim A. Qahtan
- 5. Wenbo Tao
- 6. Ziawasch Abedjan
- 7. Ahmed Elmagarmid
- 8. Ihab F. Ilyas
- 9. Samuel Madden
- 10. Mourad Ouzzani
- 11. Michael Stonebraker
- 12. Nan Tang
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,463 | ARDA: Automatic Relational Data Augmentation for Machine Learning | 2020 | VLDB | 0.00011869295 |
| 3,265 | RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! - | 2018 | VLDB | 7.3083672e-05 |
| 3,942 | Ember: No-Code Context Enrichment via Similarity-Based Keyless Joins | 2022 | VLDB | 6.6114622e-05 |
| 6,438 | RONIN: Data Lake Exploration | 2021 | VLDB | 5.0620163e-05 |
| 8,000 | Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics | 2019 | VLDB | 4.6092803e-05 |
| 8,910 | R2D2: Reducing Redundancy and Duplication in Data Lakes | 2023 | SIGMOD | 4.427232e-05 |
| 11,480 | Structural Generalizability: The Case of Similarity Search | 2021 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 518 | Data Integration for the Relational Web | 2009 | VLDB | 0.00021158934 |
| 1,277 | The Data Civilizer System | 2017 | CIDR | 0.00012879695 |
| 9,810 | Rheem: Enabling Multi-Platform Task Execution | 2016 | SIGMOD | 4.278405e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,833 | Data Wrangling: The Challenging Journey from the Wild to the Lake | 2015 | CIDR | 0.00010378976 |
| 3,281 | Constance: An Intelligent Data Lake System | 2016 | SIGMOD | 7.2823287e-05 |
| 10,439 | Finding What You’re Looking For: A Distribution-Aware Dataset Search Engine in Action | 2025 | SIGMOD | 4.1945683e-05 |
| 11,319 | Building a Shared Conceptual Model of Complex, Heterogeneous Data Systems: A Demonstration | 2022 | CIDR | 4.1945683e-05 |
| 10,433 | DataDazzle: Intelligent Data Exploration through Natural Language | 2025 | SIGMOD | 4.1945683e-05 |
| 6,384 | A Demonstration of DBWipes: Clean as You Query | 2012 | VLDB | 5.0880333e-05 |
| 2,946 | BigDansing: A System for Big Data Cleansing | 2015 | SIGMOD | 7.8372441e-05 |
| 4,426 | Data Debugging and Exploration with Vizier | 2019 | SIGMOD | 6.1969994e-05 |
| 8,000 | Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics | 2019 | VLDB | 4.6092803e-05 |
| 1,277 | The Data Civilizer System | 2017 | CIDR | 0.00012879695 |