Global Detection of Complex Copying Relationships Between Sources
Summary: Global detection algorithm uncovers direct copying by jointly handling co-copying and transitive relationships across sources. Direction inference via multi-evidence signals and cross-item correlation; scalable and accurate on real and synthetic data. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Xin Luna Dong
- 2. Laure Berti-Equille
- 3. Yifan Hu
- 4. Divesh Srivastava
Incoming Citations (Sorted by Pagerank)
Showing 13 of 13 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 322 | Record Linkage: Similarity Measures and Algorithms | 2006 | SIGMOD | 0.00027518768 |
| 705 | Winnowing: Local Algorithms for Document Fingerprinting | 2003 | SIGMOD | 0.00017864657 |
| 855 | Integrating Conflicting Data: The Role of Source Dependence | 2009 | VLDB | 0.00015906735 |
| 1,246 | Truth Discovery and Copying Detection in a Dynamic World | 2009 | VLDB | 0.0001307161 |
| 7,229 | Sailing the Information Ocean with Awareness of Currents: Discovery and Application of Source Dependence | 2009 | CIDR | 4.7950172e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,549 | SOLOMON: Seeking the Truth Via Copying Detection | 2010 | VLDB | 4.7137426e-05 |
| 7,345 | Linking Temporal Records for Profiling Entities | 2015 | SIGMOD | 4.756212e-05 |
| 672 | An Interactive Clustering-based Approach to Integrating Source Query Interfaces on the Deep Web | 2004 | SIGMOD | 0.00018355746 |
| 229 | Reference Reconciliation in Complex Information Spaces | 2005 | SIGMOD | 0.00032242633 |
| 2,617 | Extraction and Integration of Partially Overlapping Web Sources | 2013 | VLDB | 8.4462621e-05 |
| 616 | Copy Detection Mechanisms for Digital Documents | 1995 | SIGMOD | 0.00019108201 |
| 855 | Integrating Conflicting Data: The Role of Source Dependence | 2009 | VLDB | 0.00015906735 |
| 908 | Fusing Data with Correlations | 2014 | SIGMOD | 0.00015431241 |
| 1,246 | Truth Discovery and Copying Detection in a Dynamic World | 2009 | VLDB | 0.0001307161 |
| 12,178 | Large-Scale Copy Detection | 2011 | SIGMOD | 4.1945683e-05 |