Database Paper Browser

Back to papers

Towards Scalable Visual Data Wrangling via Direct Manipulation

Summary: Reframes cleaning as direct manipulation of coordinated views, letting users repair anomalies interactively and generate reproducible scripts. Scales via user-defined detectors/wranglers, provenance, differential storage and Hopara integration. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
586
Venue
CIDR
Year
2026
Pagerank
4.1945683e-05
Overall Rank
9,984 | 30.55%
DOI
-

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 18 of 18 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
112 Potter's Wheel: An Interactive Data Cleaning System 2001 VLDB 0.00047045036
192 HoloClean: Holistic Data Repairs with Probabilistic Inference 2017 VLDB 0.00035728858
214 Scorpion: Explaining Away Outliers in Aggregate Queries 2013 VLDB 0.0003363692
1,337 HoloDetect: Few-Shot Learning for Error Detection 2019 SIGMOD 0.00012497164
1,612 Detecting Data Errors: Where are we and what needs to be done? 2016 VLDB 0.00011142794
1,894 Baran: Effective Error Correction via a Unified Context Representation and Transfer Learning 2020 VLDB 0.0001018378
2,733 The Case for Data Visualization Management Systems [Vision Paper] 2014 VLDB 8.2078862e-05
3,393 Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows 2022 VLDB 7.1483239e-05
3,396 Automatic Data Repair: Are We Ready to Deploy? 2024 VLDB 7.1455126e-05
4,554 A Demonstration of AutoOD: A Self-Tuning Anomaly Detection System 2022 VLDB 6.0911296e-05
5,153 Horizon: Scalable Dependency-driven Data Cleaning 2021 VLDB 5.6607963e-05
5,684 Dagger: A Data (not code) Debugger 2020 CIDR 5.3720749e-05
7,052 Pre-trained Embeddings for Entity Resolution: An Experimental Analysis 2023 VLDB 4.8497453e-05
7,449 OTClean: Data Cleaning for Conditional Independence Violations using Optimal Transport 2024 SIGMOD 4.7269357e-05
9,475 OIE: An Interpretable System for Outlier Explanation and Summarization 2025 SIGMOD 4.3341665e-05
9,479 Data Imputation with Limited Data Redundancy Using Data Lakes 2025 VLDB 4.3341665e-05
9,500 Arachnid: Generalized Visual Data Cleaning 2019 SIGMOD 4.3341665e-05
10,828 Buckaroo: A Direct Manipulation Visual Data Wrangler 2025 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Semantically Similar Papers