Learning Semantic String Transformations from Examples
Summary: Semantic string transformations exploit data-type semantics via table lookups, not regex, to map dates, currencies, and mixed values. Programming-by-example synthesis learns all such transformations in an expressive language blending table lookups with syntactic ops, implemented as an Excel add-in and evaluated on benchmarks. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Rishabh Singh
- 2. Sumit Gulwani
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,267 | Foofah: Transforming Data By Example | 2017 | SIGMOD | 0.00012936483 |
| 1,469 | BlinkFill: Semi-supervised Programming By Example for Syntactic String Transformations | 2016 | VLDB | 0.00011836053 |
| 3,192 | Towards Dependable Data Repairing with Fixing Rules | 2014 | SIGMOD | 7.4095761e-05 |
| 5,280 | Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-V | 2023 | VLDB | 5.5896735e-05 |
| 6,818 | NLyze: Interactive Programming by Natural Language for SpreadSheet Data Analysis and Manipulation | 2014 | SIGMOD | 4.916347e-05 |
| 7,766 | ICARUS: Minimizing Human Effort in Iterative Data Completion | 2018 | VLDB | 4.6564959e-05 |
| 7,812 | Foofah: A Programming-By-Example System for Synthesizing Data Transformation Programs | 2017 | SIGMOD | 4.6443197e-05 |
| 11,087 | Dealing with Acronyms, Abbreviations, and Typos in Real-World Entity Matching | 2024 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 322 | Record Linkage: Similarity Measures and Algorithms | 2006 | SIGMOD | 0.00027518768 |
| 492 | Query by Output | 2009 | SIGMOD | 0.00021974699 |
| 3,328 | Multi-column Substring Matching for Database Schema Translation | 2006 | VLDB | 7.2174278e-05 |
| 3,451 | Learning String Transformations From Examples | 2009 | VLDB | 7.0822216e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,478 | Transform-Data-by-Example (TDE): An Extensible Search Engine for Data Transformations | 2018 | VLDB | 7.054159e-05 |
| 11,343 | SPINE: Scaling up Programming-by-Negative-Example for String Filtering and Transformation | 2022 | SIGMOD | 4.1945683e-05 |
| 11,888 | Synthesizing Data Programs | 2015 | CIDR | 4.1945683e-05 |
| 3,451 | Learning String Transformations From Examples | 2009 | VLDB | 7.0822216e-05 |
| 5,275 | Auto-Tables: Synthesizing Multi-Step Transformations to Relationalize Tables without Using Examples | 2023 | VLDB | 5.5905507e-05 |
| 3,735 | Auto-Join: Joining Tables by Leveraging Transformations | 2017 | VLDB | 6.8061318e-05 |
| 9,389 | DataVinci: Learning Syntactic and Semantic String Repairs | 2025 | SIGMOD | 4.3441378e-05 |
| 4,695 | DataXFormer: An Interactive Data Transformation Tool | 2015 | SIGMOD | 5.9927993e-05 |
| 5,937 | DataXFormer: Leveraging the Web for Semantic Transformations | 2015 | CIDR | 5.2650964e-05 |
| 5,096 | Auto-Transform: Learning-to-Transform by Patterns | 2020 | VLDB | 5.7011825e-05 |