| 4,682 |
Scalable Discovery of Unique Column Combinations
|
2014 |
VLDB |
6.0022412e-05 |
| 4,695 |
DataXFormer: An Interactive Data Transformation Tool
|
2015 |
SIGMOD |
5.9927993e-05 |
| 4,784 |
Divide & Conquer-based Inclusion Dependency Discovery
|
2015 |
VLDB |
5.9240851e-05 |
| 4,825 |
Synthesizing Natural Language to Visualization (NL2VIS) Benchmarks from NL2SQL Benchmarks
|
2021 |
SIGMOD |
5.8946721e-05 |
| 4,904 |
Temporal Rules Discovery for Web Data Cleaning
|
2016 |
VLDB |
5.8399195e-05 |
| 5,028 |
Adaptive Data Augmentation for Supervised Learning over Missing Data
|
2021 |
VLDB |
5.7506746e-05 |
| 5,153 |
Horizon: Scalable Dependency-driven Data Cleaning
|
2021 |
VLDB |
5.6607963e-05 |
| 5,192 |
Pattern Functional Dependencies for Data Cleaning
|
2020 |
VLDB |
5.6375087e-05 |
| 5,205 |
ANMAT: Automatic Knowledge Discovery and Error Detection through Pattern Functional Dependencies
|
2019 |
SIGMOD |
5.630869e-05 |
| 5,381 |
Selective Data Acquisition in the Wild for Model Charging
|
2022 |
VLDB |
5.5399508e-05 |
| 5,382 |
That's All Folks! LLUNATIC Goes Open Source
|
2014 |
VLDB |
5.5397633e-05 |
| 5,462 |
RetClean: Retrieval-Based Data Cleaning Using LLMs and Data Lakes
|
2024 |
VLDB |
5.494769e-05 |
| 5,469 |
Learned Cardinality Estimation for Similarity Queries
|
2021 |
SIGMOD |
5.4898192e-05 |
| 5,484 |
DeepEye: Creating Good Data Visualizations by Keyword Search
|
2018 |
SIGMOD |
5.4826544e-05 |
| 5,660 |
Descriptive and Prescriptive Data Cleaning
|
2014 |
SIGMOD |
5.3847321e-05 |
| 5,684 |
Dagger: A Data (not code) Debugger
|
2020 |
CIDR |
5.3720749e-05 |
| 5,729 |
KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing
|
2015 |
VLDB |
5.3506368e-05 |
| 5,790 |
AQWA: Adaptive Query-Workload-Aware Partitioning of Big Spatial Data
|
2015 |
VLDB |
5.3269734e-05 |
| 5,937 |
DataXFormer: Leveraging the Web for Semantic Transformations
|
2015 |
CIDR |
5.2650964e-05 |
| 5,963 |
Automatic Data Acquisition for Deep Learning
|
2021 |
VLDB |
5.2526794e-05 |
| 6,280 |
Self-supervised and Interpretable Data Cleaning with Sequence Generative Adversarial Networks
|
2023 |
VLDB |
5.1290457e-05 |
| 6,330 |
Efficient Construction of Approximate Ad-Hoc ML models Through Materialization and Reuse
|
2018 |
VLDB |
5.1077416e-05 |
| 6,350 |
NADEEF: A Generalized Data Cleaning System
|
2013 |
VLDB |
5.101815e-05 |
| 6,410 |
Publishing Attributed Social Graphs with Formal Privacy Guarantees
|
2016 |
SIGMOD |
5.0753667e-05 |
| 6,569 |
Domain Adaptation for Deep Entity Resolution
|
2022 |
SIGMOD |
5.0065379e-05 |
| 6,744 |
MapRat: Meaningful Explanation, Interactive Exploration and Geo-Visualization of Collaborative Ratings
|
2012 |
VLDB |
4.9419773e-05 |
| 6,801 |
Updating Graph Indices with a One-Pass Algorithm
|
2015 |
SIGMOD |
4.9226813e-05 |
| 6,842 |
Towards Democratizing Relational Data Visualization
|
2019 |
SIGMOD |
4.9103931e-05 |
| 6,854 |
Sapphire: Querying RDF Data Made Simple
|
2016 |
VLDB |
4.9066129e-05 |
| 6,865 |
Indexed Fast Network Proximity Querying
|
2018 |
VLDB |
4.9041884e-05 |
| 6,986 |
A Cost-based Optimizer for Gradient Descent Optimization
|
2017 |
SIGMOD |
4.8727048e-05 |
| 7,179 |
Coresets over Multiple Tables for Feature-rich and Data-efficient Machine Learning
|
2023 |
VLDB |
4.8078895e-05 |
| 7,643 |
Cross Modal Data Discovery over Structured and Unstructured Data Lakes
|
2023 |
VLDB |
4.6901105e-05 |
| 7,958 |
CARTILAGE: Adding Flexibility to the Hadoop Skeleton
|
2013 |
SIGMOD |
4.613363e-05 |
| 8,000 |
Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics
|
2019 |
VLDB |
4.6092803e-05 |
| 8,006 |
ALEX: Automatic Link Exploration in Linked Data
|
2015 |
SIGMOD |
4.6080343e-05 |
| 8,268 |
Learned Data-aware Image Representations of Line Charts for Similarity Search
|
2023 |
SIGMOD |
4.5456668e-05 |
| 8,294 |
QARTA: An ML-based System for Accurate Map Services
|
2021 |
VLDB |
4.5435639e-05 |
| 8,300 |
sPCA: Scalable Principal Component Analysis for Big Data on Distributed Platforms
|
2015 |
SIGMOD |
4.5435639e-05 |
| 8,366 |
WWHow! Freeing Data Storage from Cages
|
2013 |
CIDR |
4.5357016e-05 |
| 8,406 |
DADER: Hands-Off Entity Resolution with Domain Adaptation
|
2022 |
VLDB |
4.5220083e-05 |
| 8,629 |
Spatial Queries with Two kNN Predicates
|
2012 |
VLDB |
4.4809879e-05 |
| 8,653 |
ApproxML: Efficient Approximate Ad-Hoc ML Models Through Materialization and Reuse
|
2019 |
VLDB |
4.475291e-05 |
| 8,789 |
Machine Learning Meets Big Spatial Data
|
2019 |
VLDB |
4.4509194e-05 |
| 8,793 |
PAQO: A Preference-Aware Query Optimizer for PostgreSQL
|
2013 |
VLDB |
4.4505944e-05 |
| 8,828 |
HAIPipe: Combining Human-generated and Machine-generated Pipelines for Data Preparation
|
2023 |
SIGMOD |
4.4407488e-05 |
| 8,921 |
Leveraging Similarity Joins for Signal Reconstruction
|
2018 |
VLDB |
4.427232e-05 |
| 9,100 |
Who Tags What? An Analysis Framework
|
2012 |
VLDB |
4.3965818e-05 |
| 9,137 |
Combating Fake News: A Data Management and Mining Perspective
|
2019 |
VLDB |
4.3881065e-05 |
| 9,176 |
RDFind: Scalable Conditional Inclusion Dependency Discovery in RDF Datasets
|
2016 |
SIGMOD |
4.383548e-05 |