Database Paper Browser

Back to authors

Ju Fan

Author ID
721
ORCID
0000-0003-4729-9903
Links
(found by gpt-5.4 on apr 12 2026)
Most Frequent Institution
Renmin University of China
Pagerank
0.29624962
Overall Rank
158 | 99.26%
Paper Count
40

Affiliation Timeline

Incoming Non-self Citations Over Time

Total yearly non-self incoming citations across all papers by this author.

Publications by Paper Pagerank

Showing 40 of 40 publications.

Rank Title Year Venue Pagerank
998 CodeS: Towards Building Open-source Language Models for Text-to-SQL 2024 SIGMOD 0.00014729379
1,541 Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes 2023 CIDR 0.00011456579
2,349 RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation 2021 VLDB 8.9876423e-05
2,490 Online Topic-Aware Influence Maximization 2015 VLDB 8.6584707e-05
2,945 Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning 2023 SIGMOD 7.8377395e-05
3,318 Trajectory Simplification: An Experimental Study and Quality Analysis 2018 VLDB 7.2282052e-05
3,322 iCrowd: An Adaptive Crowdsourcing Framework 2015 SIGMOD 7.2230626e-05
4,102 GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data 2023 SIGMOD 6.4522929e-05
4,212 Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration 2023 SIGMOD 6.3555142e-05
4,884 Relational Data Synthesis using Generative Adversarial Networks: A Design Space Exploration 2020 VLDB 5.8540287e-05
4,908 Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL 2024 VLDB 5.8339245e-05
5,028 Adaptive Data Augmentation for Supervised Learning over Missing Data 2021 VLDB 5.7506746e-05
5,232 SEAL: Spatio-Textual Similarity Search 2012 VLDB 5.6136151e-05
5,279 CDB: A Crowd-Powered Database System 2018 VLDB 5.5902418e-05
5,359 Discovering Your Selling Points: Personalized Social Influential Tags Exploration 2017 SIGMOD 5.5485493e-05
5,830 GEMINI: An Integrative Healthcare Analytics System 2014 VLDB 5.3113542e-05
5,958 Fine-grained Concept Linking using Neural Networks in Healthcare 2018 SIGMOD 5.2563968e-05
6,569 Domain Adaptation for Deep Entity Resolution 2022 SIGMOD 5.0065379e-05
6,765 Automatic Database Configuration Debugging using Retrieval-Augmented Language Models 2025 SIGMOD 4.9325583e-05
6,855 DBease: Making Databases User-friendly and Easily Accessible 2011 CIDR 4.9062505e-05
6,868 Cost-Effective Data Annotation using Game-Based Crowdsourcing 2019 VLDB 4.9010083e-05
7,117 Crowdsourced Data Management: Overview and Challenges 2017 SIGMOD 4.826509e-05
8,343 CrowdGame: A Game-Based Crowdsourcing System for Cost-Effective Data Labeling 2019 SIGMOD 4.5429217e-05
8,406 DADER: Hands-Off Entity Resolution with Domain Adaptation 2022 VLDB 4.5220083e-05
8,523 Controllable Tabular Data Synthesis Using Diffusion Models 2024 SIGMOD 4.4937074e-05
8,828 HAIPipe: Combining Human-generated and Machine-generated Pipelines for Data Preparation 2023 SIGMOD 4.4407488e-05
9,077 VerifAI: Verified Generative AI 2024 CIDR 4.4010762e-05
9,371 Auto-Formula: Recommend Formulas in Spreadsheets using Contrastive Learning for Table Representations 2024 SIGMOD 4.3480692e-05
9,479 Data Imputation with Limited Data Redundancy Using Data Lakes 2025 VLDB 4.3341665e-05
10,249 TACO: A Benchmark for Open-Domain Text-to-SQL with Ambiguous and Cross-Database Queries 2026 VLDB 4.1945683e-05
10,424 Andromeda: Debugging Database Performance Issues with Retrieval-Augmented Large Language Models 2025 SIGMOD 4.1945683e-05
10,610 Weak-to-Strong Prompts with Lightweight-to-Powerful LLMs for High-Accuracy, Low-Cost, and Explainable Data Transformation 2025 VLDB 4.1945683e-05
10,682 AutoPrep: Natural Language Question-Aware Data Preparation with a Multi-Agent Framework 2025 VLDB 4.1945683e-05
10,707 PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking 2025 VLDB 4.1945683e-05
10,837 Natural Language to SQL: State of the Art and Open Problems 2025 VLDB 4.1945683e-05
11,000 MisDetect: Iterative Mislabel Detection using Early Loss 2024 VLDB 4.1945683e-05
11,026 Improving Graph Compression for Efficient Resource-Constrained Graph Analytics 2024 VLDB 4.1945683e-05
11,347 OpenTFV: An Open Domain Table-Based Fact Verification System 2022 SIGMOD 4.1945683e-05
11,788 CDB: Optimizing Queries with Crowd-Based Selections and Joins 2017 SIGMOD 4.1945683e-05
12,038 TsingNUS: A Location-Based Service System Towards Live City 2013 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Frequent Co-authors

Co-authored at least 5 papers.

Co-author Shared Papers Rank Pagerank
Guoliang Li 22 8 0.98178505
Nan Tang 19 47 0.55638652
Xiaoyong Du 17 115 0.35857203
Chengliang Chai 9 211 0.24025524
Lei Cao 6 146 0.30898998
Yuyu Luo 6 427 0.13303167
Samuel R. Madden 5 1 1.3916842
Tongyu Liu 5 2,130 0.035678835