Database Paper Browser

Back to papers

Foofah: Transforming Data By Example

Summary: Data wrangling is automated by example. Foofah synthesizes transformation programs from a small input-output example by efficiently searching the space of operations, outperforming Wrangler with about 60% less user effort. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5410
Venue
SIGMOD
Year
2017
Pagerank
0.00012936483
Overall Rank
1,267 | 91.19%
DOI
10.1145/3035918.3064034

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 26 of 26 citing papers.

Rank Citing Paper Year Venue Pagerank
3,252 Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks 2020 SIGMOD 7.3178277e-05
3,478 Transform-Data-by-Example (TDE): An Extensible Search Engine for Data Transformations 2018 VLDB 7.054159e-05
3,690 Navigating the Data Lake with DATAMARAN: Automatically Extracting Structure from Log Datasets 2018 SIGMOD 6.8384476e-05
3,735 Auto-Join: Joining Tables by Leveraging Transformations 2017 VLDB 6.8061318e-05
4,635 Mining Precision Interfaces From Query Logs 2019 SIGMOD 6.033398e-05
5,096 Auto-Transform: Learning-to-Transform by Patterns 2020 VLDB 5.7011825e-05
5,275 Auto-Tables: Synthesizing Multi-Step Transformations to Relationalize Tables without Using Examples 2023 VLDB 5.5905507e-05
5,280 Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-V 2023 VLDB 5.5896735e-05
5,383 Auto-Pipeline: Synthesizing Complex Data Pipelines By-Target Using Reinforcement Learning and Search 2021 VLDB 5.5393038e-05
6,800 DTT: An Example-Driven Tabular Transformer for Joinability by Leveraging Large Language Models 2024 SIGMOD 4.9231471e-05
7,458 Trinity: An Extensible Synthesis Framework for Data Science 2019 VLDB 4.7245406e-05
7,463 Automated Migration of Hierarchical Data to Relational Tables using Programming-by-Example 2018 VLDB 4.7232241e-05
7,812 Foofah: A Programming-By-Example System for Synthesizing Data Transformation Programs 2017 SIGMOD 4.6443197e-05
8,042 Transform-Data-by-Example (TDE): Extensible Data Transformation in Excel 2018 SIGMOD 4.5994569e-05
8,088 PIDS: Attribute Decomposition for Improved Compression and Query Performance in Columnar Storage 2020 VLDB 4.5897316e-05
9,035 Data-Driven Insight Synthesis for Multi-Dimensional Data 2024 VLDB 4.4039656e-05
9,371 Auto-Formula: Recommend Formulas in Spreadsheets using Contrastive Learning for Table Representations 2024 SIGMOD 4.3480692e-05
9,399 TabulaX: Leveraging Large Language Models for Multi-Class Table Transformations 2025 VLDB 4.3441378e-05
9,777 Data Augmentation for ML-driven Data Preparation and Integration 2021 VLDB 4.2856106e-05
9,886 Scalable and Usable Relational Learning With Automatic Language Bias 2021 SIGMOD 4.2621158e-05
10,019 Guardrail: Automated Integrity Constraint Synthesis From Noisy Data 2026 SIGMOD 4.1945683e-05
10,598 Auto-Prep: Holistic Prediction of Data Preparation Steps for Self-Service Business Intelligence 2025 VLDB 4.1945683e-05
10,610 Weak-to-Strong Prompts with Lightweight-to-Powerful LLMs for High-Accuracy, Low-Cost, and Explainable Data Transformation 2025 VLDB 4.1945683e-05
11,297 DataRinse: Semantic Transforms for Data preparation based on Code Mining 2023 VLDB 4.1945683e-05
11,343 SPINE: Scaling up Programming-by-Negative-Example for String Filtering and Transformation 2022 SIGMOD 4.1945683e-05
11,754 Constraint-based Explanation and Repair of Filter-based Transformations 2018 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 8 of 8 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers