Entity Matching Meets Data Science: A Progress Report from the Magellan Project
Summary: Magellan treats entity matching as a data-science ecosystem with interoperable tools (PyMatcher, CloudMatcher) for power and lay users. Over 3.5 years, it reports production deployments across 21 EM tasks in 12 companies and a cloud-native, Docker/Kubernetes ecosystem. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Yash Govind
- 2. Pradap Konda
- 3. Paul Suganthan G. C.
- 4. Philip Martinkus
- 5. Palaniappan Nagarajan
- 6. Han Li
- 7. Aravind Soundararajan
- 8. Sidharth Mudgal
- 9. Jeffrey R. Ballard
- 10. Haojun Zhang
- 11. Adel Ardalan
- 12. Sanjib Das
- 13. Derek Paulsen
- 14. Amanpreet Saini
- 15. Erik Paulson
- 16. Youngchoon Park
- 17. Marshall Carter
- 18. Mingju Sun
- 19. Glenn M. Fung
- 20. AnHai Doan
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,703 | Medical Entity Disambiguation Using Graph Neural Networks | 2021 | SIGMOD | 5.9855056e-05 |
| 8,099 | Sparkly: A Simple yet Surprisingly Strong TF/IDF Blocker for Entity Matching | 2023 | VLDB | 4.5859317e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 300 | Deep Learning for Entity Matching: A Design Space Exploration | 2018 | SIGMOD | 0.00028441466 |
| 712 | Magellan: Toward Building Entity Matching Management Systems | 2016 | VLDB | 0.00017732426 |
| 2,038 | The return of JedAI: End-to-End Entity Resolution for Structured and Semi-Structured Data | 2018 | VLDB | 9.7098952e-05 |
| 2,175 | Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services | 2017 | SIGMOD | 9.3644117e-05 |
| 2,854 | The Garlic Project | 1996 | SIGMOD | 8.0103732e-05 |
| 4,402 | Smurf: Self-Service String Matching Using Random Forests | 2019 | VLDB | 6.2195162e-05 |
| 11,739 | CloudMatcher: A Hands-Off Cloud/Crowd Service for Entity Matching | 2018 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next