Database Paper Browser

Back to papers

BlinkFill: Semi-supervised Programming By Example for Syntactic String Transformations

Summary: BlinkFill: semi-supervised PBE for syntactic string transformations, using InputDataGraph to capture shared input patterns and guide synthesis. Evaluated on 207 benchmarks, it attains ~41x speedups and uses fewer examples (1.27 vs 1.53) than FlashFill. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11369
Venue
VLDB
Year
2016
Pagerank
0.00011836053
Overall Rank
1,469 | 89.79%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 23 of 23 citing papers.

Rank Citing Paper Year Venue Pagerank
1,267 Foofah: Transforming Data By Example 2017 SIGMOD 0.00012936483
1,894 Baran: Effective Error Correction via a Unified Context Representation and Transfer Learning 2020 VLDB 0.0001018378
2,158 Uni-Detect: A Unified Approach to Automated Error Detection in Tables 2019 SIGMOD 9.4141354e-05
2,968 Raha: A Configuration-Free Error Detection System 2019 SIGMOD 7.7985097e-05
3,252 Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks 2020 SIGMOD 7.3178277e-05
3,478 Transform-Data-by-Example (TDE): An Extensible Search Engine for Data Transformations 2018 VLDB 7.054159e-05
3,735 Auto-Join: Joining Tables by Leveraging Transformations 2017 VLDB 6.8061318e-05
5,096 Auto-Transform: Learning-to-Transform by Patterns 2020 VLDB 5.7011825e-05
5,192 Pattern Functional Dependencies for Data Cleaning 2020 VLDB 5.6375087e-05
5,280 Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-V 2023 VLDB 5.5896735e-05
5,383 Auto-Pipeline: Synthesizing Complex Data Pipelines By-Target Using Reinforcement Learning and Search 2021 VLDB 5.5393038e-05
6,237 New Trends on Exploratory Methods for Data Analytics 2017 VLDB 5.1435341e-05
6,758 Data Migration using Datalog Program Synthesis 2020 VLDB 4.937199e-05
6,800 DTT: An Example-Driven Tabular Transformer for Joinability by Leveraging Large Language Models 2024 SIGMOD 4.9231471e-05
6,996 Web Data Extraction using Hybrid Program Synthesis: A Combination of Top-down and Bottom-up Inference 2020 SIGMOD 4.8681362e-05
7,463 Automated Migration of Hierarchical Data to Relational Tables using Programming-by-Example 2018 VLDB 4.7232241e-05
8,042 Transform-Data-by-Example (TDE): Extensible Data Transformation in Excel 2018 SIGMOD 4.5994569e-05
8,344 Exploring the Data Wilderness through Examples 2019 SIGMOD 4.5428111e-05
9,035 Data-Driven Insight Synthesis for Multi-Dimensional Data 2024 VLDB 4.4039656e-05
9,399 TabulaX: Leveraging Large Language Models for Multi-Class Table Transformations 2025 VLDB 4.3441378e-05
10,598 Auto-Prep: Holistic Prediction of Data Preparation Steps for Self-Service Business Intelligence 2025 VLDB 4.1945683e-05
10,610 Weak-to-Strong Prompts with Lightweight-to-Powerful LLMs for High-Accuracy, Low-Cost, and Explainable Data Transformation 2025 VLDB 4.1945683e-05
11,343 SPINE: Scaling up Programming-by-Negative-Example for String Filtering and Transformation 2022 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 8 of 8 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers