Automatic Data Fusion with HumMer
Summary: HumMer enables ad-hoc, declarative data fusion over heterogeneous, dirty data via a simple SQL extension. It automates instance-based schema matching, deduplication, and data fusion with conflict resolution, guided by a query across multiple tables. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Alexander Bilke
- 2. Jens Bleiholder
- 3. Felix Naumann
- 4. Christoph Böhm
- 5. Melanie Weis
- 6. Karsten Draba
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,559 | FuSem – Exploring Different Semantics of Data Fusion | 2007 | VLDB | 8.5441188e-05 |
| 12,425 | XClean in Action: A Demonstration of Declarative XML Data Cleaning | 2007 | CIDR | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 149 | Trio: A System for Integrated Management of Data, Accuracy, and Lineage | 2005 | CIDR | 0.00041101118 |
| 2,214 | XXL - A Library Approach to Supporting Efficient Implementations of Advanced Database Queries* | 2001 | VLDB | 9.2726469e-05 |
| 2,589 | DogmatiX Tracks down Duplicates in XML | 2005 | SIGMOD | 8.4847146e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,693 | Merging Models Based on Given Correspondences | 2003 | VLDB | 0.00010900382 |
| 5,564 | Interactive Generation of Integrated Schemas | 2008 | SIGMOD | 5.4320854e-05 |
| 9,020 | Entity Matching in the Wild: A Consistent and Versatile Framework to Unify Data in Industrial Applications | 2020 | SIGMOD | 4.4079449e-05 |
| 11,006 | FusionQuery: On-demand Fusion Queries over Multi-source Heterogeneous Data | 2024 | VLDB | 4.1945683e-05 |
| 3,712 | MOMA - A Mapping-based Object Matching System | 2007 | CIDR | 6.823134e-05 |
| 13,686 | Efficient development of data migration transformations | 2004 | SIGMOD | - |
| 280 | Eliminating Fuzzy Duplicates in Data Warehouses | 2002 | VLDB | 0.00029113044 |
| 6,155 | MapMerge: Correlating Independent Schema Mappings | 2010 | VLDB | 5.1802715e-05 |
| 2,452 | Data Fusion – Resolving Data Conflicts for Integration | 2009 | VLDB | 8.7839322e-05 |
| 2,559 | FuSem – Exploring Different Semantics of Data Fusion | 2007 | VLDB | 8.5441188e-05 |