| 4,353 |
Overlap Set Similarity Joins with Theoretical Guarantees
|
2018 |
SIGMOD |
6.263585e-05 |
| 4,414 |
Efficient Type-Ahead Search on Relational Data: a TASTIER Approach
|
2009 |
SIGMOD |
6.2056993e-05 |
| 4,437 |
Clash of the Titans: MapReduce vs. Spark for Large Scale Data Analytics
|
2015 |
VLDB |
6.1907793e-05 |
| 4,514 |
An Empirical Evaluation of Columnar Storage Formats
|
2024 |
VLDB |
6.1204636e-05 |
| 4,531 |
Efficient Document Analytics on Compressed Data: Method, Challenges, Algorithms, Insights
|
2018 |
VLDB |
6.1073703e-05 |
| 4,543 |
FACE: A Normalizing Flow based Cardinality Estimator
|
2022 |
VLDB |
6.1011198e-05 |
| 4,579 |
Crowdsourced Top-k Algorithms: An Experimental Evaluation
|
2016 |
VLDB |
6.070469e-05 |
| 4,635 |
Mining Precision Interfaces From Query Logs
|
2019 |
SIGMOD |
6.033398e-05 |
| 4,646 |
CARMI: A Cache-Aware Learned Index with a Cost-based Construction Algorithm
|
2022 |
VLDB |
6.0250374e-05 |
| 4,825 |
Synthesizing Natural Language to Visualization (NL2VIS) Benchmarks from NL2SQL Benchmarks
|
2021 |
SIGMOD |
5.8946721e-05 |
| 4,835 |
Proteus: A Self-Designing Range Filter
|
2022 |
SIGMOD |
5.8905445e-05 |
| 4,854 |
TOAIN: A Throughput Optimizing Adaptive Index for Answering Dynamic kNN Queries on Road Networks
|
2018 |
VLDB |
5.8743687e-05 |
| 4,879 |
Approximately Counting Triangles in Large Graph Streams Including Edge Duplicates with a Fixed Memory Usage
|
2018 |
VLDB |
5.8575676e-05 |
| 4,884 |
Relational Data Synthesis using Generative Adversarial Networks: A Design Space Exploration
|
2020 |
VLDB |
5.8540287e-05 |
| 4,908 |
Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL
|
2024 |
VLDB |
5.8339245e-05 |
| 4,911 |
Unsupervised Contextual Anomaly Detection for Database Systems
|
2022 |
SIGMOD |
5.8328593e-05 |
| 4,976 |
Efficient Top-K SimRank-based Similarity Join
|
2015 |
VLDB |
5.7882361e-05 |
| 5,002 |
Sequential Data Cleaning: A Statistical Approach
|
2016 |
SIGMOD |
5.7671075e-05 |
| 5,028 |
Adaptive Data Augmentation for Supervised Learning over Missing Data
|
2021 |
VLDB |
5.7506746e-05 |
| 5,071 |
Time Series Data Encoding for Efficient Storage: A Comparative Analysis in Apache IoTDB
|
2022 |
VLDB |
5.7188461e-05 |
| 5,073 |
Faerie: Efficient Filtering Algorithms for Approximate Dictionary-based Entity Extraction
|
2011 |
SIGMOD |
5.7177424e-05 |
| 5,074 |
Learned Index: A Comprehensive Experimental Evaluation
|
2023 |
VLDB |
5.7175726e-05 |
| 5,078 |
Efficient Location-Aware Influence Maximization
|
2014 |
SIGMOD |
5.715243e-05 |
| 5,084 |
In-Database Machine Learning with CorgiPile: Stochastic Gradient Descent without Full Data Shuffle
|
2022 |
SIGMOD |
5.7091191e-05 |
| 5,110 |
LightNE: A Lightweight Graph Processing System for Network Embedding
|
2021 |
SIGMOD |
5.6901951e-05 |
| 5,232 |
SEAL: Spatio-Textual Similarity Search
|
2012 |
VLDB |
5.6136151e-05 |
| 5,253 |
Enriching Data Imputation with Extensive Similarity Neighbors
|
2015 |
VLDB |
5.6014916e-05 |
| 5,255 |
Efficient k-Regret Query Algorithm with Restriction-free Bound for any Dimensionality
|
2018 |
SIGMOD |
5.6013035e-05 |
| 5,279 |
CDB: A Crowd-Powered Database System
|
2018 |
VLDB |
5.5902418e-05 |
| 5,283 |
Optimistic Concurrency Control by Melding Trees
|
2011 |
VLDB |
5.5856276e-05 |
| 5,321 |
FreshGNN: Reducing Memory Access via Stable Historical Embeddings for Graph Neural Network Training
|
2024 |
VLDB |
5.5710441e-05 |
| 5,362 |
Cost-Effective Crowdsourced Entity Resolution: A Partial-Order Approach
|
2016 |
SIGMOD |
5.5473503e-05 |
| 5,371 |
LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning
|
2022 |
SIGMOD |
5.5428776e-05 |
| 5,381 |
Selective Data Acquisition in the Wild for Model Charging
|
2022 |
VLDB |
5.5399508e-05 |
| 5,469 |
Learned Cardinality Estimation for Similarity Queries
|
2021 |
SIGMOD |
5.4898192e-05 |
| 5,484 |
DeepEye: Creating Good Data Visualizations by Keyword Search
|
2018 |
SIGMOD |
5.4826544e-05 |
| 5,629 |
DAMR: Dynamic Adjacency Matrix Representation Learning for Multivariate Time Series Imputation
|
2023 |
SIGMOD |
5.4025905e-05 |
| 5,672 |
Effective Keyword-based Selection of Relational Databases
|
2007 |
SIGMOD |
5.3784128e-05 |
| 5,734 |
Efficient Algorithms for Crowd-Aided Categorization
|
2020 |
VLDB |
5.3482904e-05 |
| 5,777 |
ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly Detection
|
2024 |
VLDB |
5.3308813e-05 |
| 5,852 |
Repairing Vertex Labels under Neighborhood Constraints
|
2014 |
VLDB |
5.3007132e-05 |
| 5,861 |
Machine Learning for Databases
|
2021 |
VLDB |
5.298883e-05 |
| 5,863 |
GRF: A Global Range Filter for LSM-Trees with Shape Encoding
|
2024 |
SIGMOD |
5.2979639e-05 |
| 5,867 |
Combining Design and Performance in a Data Visualization Management System
|
2017 |
CIDR |
5.296418e-05 |
| 5,923 |
HyBench: A New Benchmark for HTAP Databases
|
2024 |
VLDB |
5.2721765e-05 |
| 5,963 |
Automatic Data Acquisition for Deep Learning
|
2021 |
VLDB |
5.2526794e-05 |
| 6,005 |
Evaluating Ridesharing Algorithms using the Jargo Real-Time Stochastic Simulator
|
2020 |
VLDB |
5.2415551e-05 |
| 6,014 |
WarpLDA: a Cache Efficient O(1) Algorithm for Latent Dirichlet Allocation
|
2016 |
VLDB |
5.2415551e-05 |
| 6,146 |
Distributed Graph Simulation: Impossibility and Possibility
|
2014 |
VLDB |
5.1857597e-05 |
| 6,229 |
When Tree Meets Hash: Reducing Random Reads for Index Structures on Persistent Memories
|
2023 |
SIGMOD |
5.1463389e-05 |