Smile: A System to Support Machine Learning on EEG Data at Scale
Summary: Smile is an end-to-end, scalable system for EEG interictal-ictal continuum pattern classification, fusing visualization-based labeling of 350M segments (30 TB) with a deep-learning active-learning loop. It delivers sub-second labeling latency and model-guided sample selection to clinicians, enabling rapid clinician–model convergence. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Lei Cao
- 2. Wenbo Tao
- 3. Sungtae An
- 4. Jing Jin
- 5. Yizhou Yan
- 6. Xiaoyu Liu
- 7. Wendong Ge
- 8. Adam Sah
- 9. Leilani Battle
- 10. Jimeng Sun
- 11. Remco Chang
- 12. Brandon Westover
- 13. Samuel Madden
- 14. Michael Stonebraker
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,684 | Dagger: A Data (not code) Debugger | 2020 | CIDR | 5.3720749e-05 |
| 6,448 | Sintel: A Machine Learning Framework to Extract Insights from Signals | 2022 | SIGMOD | 5.0587973e-05 |
| 8,714 | LANCET: Labeling Complex Data at Scale | 2021 | VLDB | 4.4619818e-05 |
| 9,247 | iEDeaL: A Deep Learning Framework for Detecting Highly Imbalanced Interictal Epileptiform Discharges | 2023 | VLDB | 4.3690661e-05 |
| 10,952 | RITA: Group Attention is All You Need for Timeseries Analytics | 2024 | SIGMOD | 4.1945683e-05 |
| 11,594 | TRACER: A Framework for Facilitating Accurate and Interpretable Analytics for High Stakes Applications | 2020 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 254 | Snorkel: Rapid Training Data Creation with Weak Supervision | 2018 | VLDB | 0.00030540555 |
| 460 | SeeDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics | 2015 | VLDB | 0.00022516069 |
| 1,215 | Snuba: Automating Weak Supervision to Label Training Data | 2019 | VLDB | 0.0001323375 |
| 1,587 | Dynamic Prefetching of Data Tiles for Interactive Visualization | 2016 | SIGMOD | 0.00011245116 |
| 2,011 | Rapid Sampling for Visualizations with Ordering Guarantees | 2015 | VLDB | 9.7964875e-05 |
| 2,981 | Efficient Spatial Sampling of Large Geographical Tables | 2012 | SIGMOD | 7.7809306e-05 |
| 4,681 | Adaptive Sampling for Rapidly Matching Histograms | 2018 | VLDB | 6.0034918e-05 |
| 5,370 | Kyrix: Interactive Visual Data Exploration at Scale | 2019 | CIDR | 5.5432976e-05 |
Previous
Page 1 / 1
Next