| 998 |
CodeS: Towards Building Open-source Language Models for Text-to-SQL |
2024 |
SIGMOD |
0.00014729379 |
| 1,541 |
Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes |
2023 |
CIDR |
0.00011456579 |
| 2,349 |
RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation |
2021 |
VLDB |
8.9876423e-05 |
| 2,490 |
Online Topic-Aware Influence Maximization |
2015 |
VLDB |
8.6584707e-05 |
| 2,945 |
Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning |
2023 |
SIGMOD |
7.8377395e-05 |
| 3,318 |
Trajectory Simplification: An Experimental Study and Quality Analysis |
2018 |
VLDB |
7.2282052e-05 |
| 3,322 |
iCrowd: An Adaptive Crowdsourcing Framework |
2015 |
SIGMOD |
7.2230626e-05 |
| 4,102 |
GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data |
2023 |
SIGMOD |
6.4522929e-05 |
| 4,212 |
Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration |
2023 |
SIGMOD |
6.3555142e-05 |
| 4,884 |
Relational Data Synthesis using Generative Adversarial Networks: A Design Space Exploration |
2020 |
VLDB |
5.8540287e-05 |
| 4,908 |
Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL |
2024 |
VLDB |
5.8339245e-05 |
| 5,028 |
Adaptive Data Augmentation for Supervised Learning over Missing Data |
2021 |
VLDB |
5.7506746e-05 |
| 5,232 |
SEAL: Spatio-Textual Similarity Search |
2012 |
VLDB |
5.6136151e-05 |
| 5,279 |
CDB: A Crowd-Powered Database System |
2018 |
VLDB |
5.5902418e-05 |
| 5,359 |
Discovering Your Selling Points: Personalized Social Influential Tags Exploration |
2017 |
SIGMOD |
5.5485493e-05 |
| 5,830 |
GEMINI: An Integrative Healthcare Analytics System |
2014 |
VLDB |
5.3113542e-05 |
| 5,958 |
Fine-grained Concept Linking using Neural Networks in Healthcare |
2018 |
SIGMOD |
5.2563968e-05 |
| 6,569 |
Domain Adaptation for Deep Entity Resolution |
2022 |
SIGMOD |
5.0065379e-05 |
| 6,765 |
Automatic Database Configuration Debugging using Retrieval-Augmented Language Models |
2025 |
SIGMOD |
4.9325583e-05 |
| 6,855 |
DBease: Making Databases User-friendly and Easily Accessible |
2011 |
CIDR |
4.9062505e-05 |
| 6,868 |
Cost-Effective Data Annotation using Game-Based Crowdsourcing |
2019 |
VLDB |
4.9010083e-05 |
| 7,117 |
Crowdsourced Data Management: Overview and Challenges |
2017 |
SIGMOD |
4.826509e-05 |
| 8,343 |
CrowdGame: A Game-Based Crowdsourcing System for Cost-Effective Data Labeling |
2019 |
SIGMOD |
4.5429217e-05 |
| 8,406 |
DADER: Hands-Off Entity Resolution with Domain Adaptation |
2022 |
VLDB |
4.5220083e-05 |
| 8,523 |
Controllable Tabular Data Synthesis Using Diffusion Models |
2024 |
SIGMOD |
4.4937074e-05 |
| 8,828 |
HAIPipe: Combining Human-generated and Machine-generated Pipelines for Data Preparation |
2023 |
SIGMOD |
4.4407488e-05 |
| 9,077 |
VerifAI: Verified Generative AI |
2024 |
CIDR |
4.4010762e-05 |
| 9,371 |
Auto-Formula: Recommend Formulas in Spreadsheets using Contrastive Learning for Table Representations |
2024 |
SIGMOD |
4.3480692e-05 |
| 9,479 |
Data Imputation with Limited Data Redundancy Using Data Lakes |
2025 |
VLDB |
4.3341665e-05 |
| 10,249 |
TACO: A Benchmark for Open-Domain Text-to-SQL with Ambiguous and Cross-Database Queries |
2026 |
VLDB |
4.1945683e-05 |
| 10,424 |
Andromeda: Debugging Database Performance Issues with Retrieval-Augmented Large Language Models |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,610 |
Weak-to-Strong Prompts with Lightweight-to-Powerful LLMs for High-Accuracy, Low-Cost, and Explainable Data Transformation |
2025 |
VLDB |
4.1945683e-05 |
| 10,682 |
AutoPrep: Natural Language Question-Aware Data Preparation with a Multi-Agent Framework |
2025 |
VLDB |
4.1945683e-05 |
| 10,707 |
PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking |
2025 |
VLDB |
4.1945683e-05 |
| 10,837 |
Natural Language to SQL: State of the Art and Open Problems |
2025 |
VLDB |
4.1945683e-05 |
| 11,000 |
MisDetect: Iterative Mislabel Detection using Early Loss |
2024 |
VLDB |
4.1945683e-05 |
| 11,026 |
Improving Graph Compression for Efficient Resource-Constrained Graph Analytics |
2024 |
VLDB |
4.1945683e-05 |
| 11,347 |
OpenTFV: An Open Domain Table-Based Fact Verification System |
2022 |
SIGMOD |
4.1945683e-05 |
| 11,788 |
CDB: Optimizing Queries with Crowd-Based Selections and Joins |
2017 |
SIGMOD |
4.1945683e-05 |
| 12,038 |
TsingNUS: A Location-Based Service System Towards Live City |
2013 |
SIGMOD |
4.1945683e-05 |