Synthesizing Type-Detection Logic for Rich Semantic Data Types using Open-source Code
Summary: Autotype synthesizes semantic-type detectors by mining code from positive examples and a keyword; it derives type-detection logic from execution traces. On a 112-type benchmark, it detects 84 with precision, boosting web-table discovery vs. baselines. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,158 | Uni-Detect: A Unified Approach to Automated Error Detection in Tables | 2019 | SIGMOD | 9.4141354e-05 |
| 2,587 | Table-GPT: Table Fine-tuned GPT for Diverse Table Tasks | 2024 | SIGMOD | 8.4924618e-05 |
| 2,888 | Sato: Contextual Semantic Type Detection in Tables | 2020 | VLDB | 7.9594996e-05 |
| 3,478 | Transform-Data-by-Example (TDE): An Extensible Search Engine for Data Transformations | 2018 | VLDB | 7.054159e-05 |
| 5,096 | Auto-Transform: Learning-to-Transform by Patterns | 2020 | VLDB | 5.7011825e-05 |
| 5,242 | Towards Benchmarking Feature Type Inference for AutoML Platforms | 2021 | SIGMOD | 5.6074743e-05 |
| 7,838 | Auto-Validate: Unsupervised Data Validation Using Data-Domain Patterns Inferred from Data Lakes | 2021 | SIGMOD | 4.6377995e-05 |
| 10,512 | Auto-Test: Learning Semantic-Domain Constraints for Unsupervised Error Detection in Tables | 2025 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 12 of 12 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,163 | Extracting Schema from Semistructured Data | 1998 | SIGMOD | 0.00013577466 |
| 8,913 | Making Table Understanding Work in Practice | 2022 | CIDR | 4.427232e-05 |
| 11,205 | Steered Training Data Generation for Learned Semantic Type Detection | 2023 | SIGMOD | 4.1945683e-05 |
| 3,230 | Learning Semantic String Transformations from Examples | 2012 | VLDB | 7.339123e-05 |
| 1,897 | Type Inference for Queries on Semistructured Data (Extended Abstract) | 1999 | PODS | 0.00010178006 |
| 10,512 | Auto-Test: Learning Semantic-Domain Constraints for Unsupervised Error Detection in Tables | 2025 | SIGMOD | 4.1945683e-05 |
| 2,506 | Auto-Detect: Data-Driven Error Detection in Tables | 2018 | SIGMOD | 8.6335464e-05 |
| 5,242 | Towards Benchmarking Feature Type Inference for AutoML Platforms | 2021 | SIGMOD | 5.6074743e-05 |
| 2,888 | Sato: Contextual Semantic Type Detection in Tables | 2020 | VLDB | 7.9594996e-05 |
| 5,099 | ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models | 2024 | VLDB | 5.6997784e-05 |