An Integrated Development Environment for Faster Feature Engineering
Summary: An IDE for accelerating feature engineering via data-centric tooling. Uses an index and runtime planner to rank raw data objects by relevance to feature code, enabling rapid evaluation of feature impact and showing speedups over baselines. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Michael R. Anderson
- 2. Michael Cafarella
- 3. Yixing Jiang
- 4. Guan Wang
- 5. Bochun Zhang
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,347 | A Relational Framework for Classifier Engineering | 2017 | PODS | 5.1019568e-05 |
| 11,476 | Enforcing Constraints for Machine Learning Systems via Declarative Feature Selection: An Experimental Study | 2021 | SIGMOD | 4.1945683e-05 |
| 13,360 | Faster Evaluation of Labor-Intensive Features | 2015 | CIDR | - |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 1 of 1 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,915 | Brainwash: A Data System for Feature Engineering | 2013 | CIDR | 7.9078385e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,347 | A Relational Framework for Classifier Engineering | 2017 | PODS | 5.1019568e-05 |
| 8,208 | SMARTFEAT: Efficient Feature Construction through Feature-Level Foundation Model Interactions | 2024 | CIDR | 4.5581306e-05 |
| 7,411 | ItemSuggest: A Data Management Platform for Machine Learned Ranking Services | 2019 | CIDR | 4.7364436e-05 |
| 11,888 | Synthesizing Data Programs | 2015 | CIDR | 4.1945683e-05 |
| 4,003 | Data Platform for Machine Learning | 2019 | SIGMOD | 6.54347e-05 |
| 6,796 | InferDB: In-Database Machine Learning Inference Using Indexes | 2024 | VLDB | 4.9241624e-05 |
| 7,138 | Ease.ml/ci and Ease.ml/meter in Action: Towards Data Management for Statistical Generalization | 2019 | VLDB | 4.8216981e-05 |
| 2,915 | Brainwash: A Data System for Feature Engineering | 2013 | CIDR | 7.9078385e-05 |
| 5,567 | Optimizing Data Pipelines for Machine Learning in Feature Stores | 2023 | VLDB | 5.4305348e-05 |
| 13,360 | Faster Evaluation of Labor-Intensive Features | 2015 | CIDR | - |