Valentine in Action: Matching Tabular Data at Scale
Summary: Open-source Valentine suite for scalable tabular-schema matching. Holistic matching ingests heterogeneous sources, outputs column similarity, and provides a GUI to fabricate datasets and evaluate matching methods for scalable dataset discovery in data lakes. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 382 | COMA - A system for flexible combination of schema matching approaches | 2002 | VLDB | 0.00024823252 |
| 1,178 | Table Union Search on Open Data | 2018 | VLDB | 0.00013468118 |
| 1,762 | Tuning Schema Matching Software using Synthetic Scenarios | 2005 | VLDB | 0.00010646894 |
| 2,788 | Incremental Schema Matching | 2006 | VLDB | 8.1251255e-05 |
| 4,486 | OpenII: An Open Source Information Integration Toolkit | 2010 | SIGMOD | 6.1455674e-05 |
| 5,646 | XBenchMatch: a Benchmark for XML Schema Matching Tools | 2007 | VLDB | 5.3919887e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,739 | CloudMatcher: A Hands-Off Cloud/Crowd Service for Entity Matching | 2018 | VLDB | 4.1945683e-05 |
| 4,537 | Privacy Preserving Schema and Data Matching | 2007 | SIGMOD | 6.1042536e-05 |
| 8,823 | The Role of Schema Matching in Large Enterprises | 2009 | CIDR | 4.4415658e-05 |
| 8,116 | LakeBench: A Benchmark for Discovering Joinable and Unionable Tables in Data Lakes | 2024 | VLDB | 4.581507e-05 |
| 5,081 | Reducing Uncertainty of Schema Matching via Crowdsourcing | 2013 | VLDB | 5.7132042e-05 |
| 8,917 | Data Lakes Empowered by Knowledge Graph Technologies | 2021 | SIGMOD | 4.427232e-05 |
| 8,824 | Analyzing and Revising Data Integration Schemas to Improve Their Matchability | 2008 | VLDB | 4.4415658e-05 |
| 916 | On Schema Matching with Opaque Column Names and Data Values | 2003 | SIGMOD | 0.00015379422 |
| 4,416 | CrowdMatcher: Crowd-Assisted Schema Matching | 2014 | SIGMOD | 6.2039225e-05 |
| 303 | Generic Schema Matching with Cupid | 2001 | VLDB | 0.00028301477 |