Automatically Generating Data Exploration Sessions Using Deep Reinforcement Learning
Summary: ATENA uses deep reinforcement learning to auto-generate full EDA notebooks from a dataset. By framing exploration as a control problem and employing a novel DRL architecture with a restricted operation set, it yields usable, insight-revealing sessions. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Ori Bar El
- 2. Tova Milo
- 3. Amit Somech
Incoming Citations (Sorted by Pagerank)
Showing 16 of 16 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 254 | Snorkel: Rapid Training Data Creation with Weak Supervision | 2018 | VLDB | 0.00030540555 |
| 460 | SeeDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics | 2015 | VLDB | 0.00022516069 |
| 991 | Effortless Data Exploration with zenvisage: An Expressive and Interactive Visual Analytics System | 2017 | VLDB | 0.00014807273 |
| 1,350 | Northstar: An Interactive Data Science System | 2018 | VLDB | 0.00012431059 |
| 2,104 | Data Polygamy: The Many-Many Relationships among Urban Spatio-Temporal Data Sets | 2016 | SIGMOD | 9.536298e-05 |
| 2,734 | Controlling False Discoveries During Interactive Data Exploration | 2017 | SIGMOD | 8.2078306e-05 |
| 4,426 | Data Debugging and Exploration with Vizier | 2019 | SIGMOD | 6.1969994e-05 |
| 4,758 | Optimization for Active Learning-based Interactive Database Exploration | 2019 | VLDB | 5.9422515e-05 |
| 9,830 | Towards Autonomous, Hands-Free Data Exploration | 2020 | CIDR | 4.2751057e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,392 | Automated Relational Data Explanation using External Semantic Knowledge | 2022 | VLDB | 4.1945683e-05 |
| 5,381 | Selective Data Acquisition in the Wild for Model Charging | 2022 | VLDB | 5.5399508e-05 |
| 5,472 | Guided Exploration of User Groups | 2020 | VLDB | 5.4888146e-05 |
| 5,383 | Auto-Pipeline: Synthesizing Complex Data Pipelines By-Target Using Reinforcement Learning and Search | 2021 | VLDB | 5.5393038e-05 |
| 7,364 | ExplainED: Explanations for EDA Notebooks | 2020 | VLDB | 4.7519211e-05 |
| 7,222 | Guided Exploration of Data Summaries | 2022 | VLDB | 4.797186e-05 |
| 9,830 | Towards Autonomous, Hands-Free Data Exploration | 2020 | CIDR | 4.2751057e-05 |
| 9,219 | Intelligent Agents for Data Exploration | 2024 | VLDB | 4.3702863e-05 |
| 5,963 | Automatic Data Acquisition for Deep Learning | 2021 | VLDB | 5.2526794e-05 |
| 4,540 | Automating Exploratory Data Analysis via Machine Learning: An Overview | 2020 | SIGMOD | 6.1033443e-05 |