ALIAS*: An Active Learning led Interactive Deduplication System
Summary: Active-learning guides interactive deduplication, needing only a few domain-specific attribute similarities and a small set of labeled record pairs. A cluster-based execution model enables scalable application of the learned deduplication function to large datasets. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Sunita Sarawagi
- 2. Anuradha Bhamidipaty
- 3. Alok Kirpal
- 4. Chandra Mouli
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,713 | GDR: A System for Guided Data Repair | 2010 | SIGMOD | 6.8224341e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 0 of 0 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,439 | (Almost) Hands-Off Information Integration for the Life Sciences | 2005 | CIDR | 5.0612141e-05 |
| 936 | Framework for Evaluating Clustering Algorithms in Duplicate Detection | 2009 | VLDB | 0.0001521549 |
| 4,758 | Optimization for Active Learning-based Interactive Database Exploration | 2019 | VLDB | 5.9422515e-05 |
| 280 | Eliminating Fuzzy Duplicates in Data Warehouses | 2002 | VLDB | 0.00029113044 |
| 8,908 | Deep Active Alignment of Knowledge Graph Entities and Schemata | 2023 | SIGMOD | 4.427232e-05 |
| 3,360 | Modeling and Querying Possible Repairs in Duplicate Detection | 2009 | VLDB | 7.1742067e-05 |
| 6,042 | MDedup: Duplicate Detection with Matching Dependencies | 2020 | VLDB | 5.2405269e-05 |
| 2,386 | Leveraging Aggregate Constraints For Deduplication | 2007 | SIGMOD | 8.9231895e-05 |
| 4,619 | Crowd-Based Deduplication: An Adaptive Approach | 2015 | SIGMOD | 6.0444854e-05 |
| 5,282 | Deep Indexed Active Learning for Matching Heterogeneous Entity Representations | 2022 | VLDB | 5.5864206e-05 |