Back to authors
Nan Tang
- Author ID
- 1260
- ORCID
-
0000-0003-2832-0295
- Links
-
(found by gpt-5.2 on feb 8th, 2026)
- Most Frequent Institution
- Qatar Computing Research Institute
- Pagerank
- 0.55638652
- Overall Rank
- 47 | 99.78%
- Paper Count
- 72
Affiliation Timeline
Incoming Non-self Citations Over Time
Total yearly non-self incoming citations across all papers by this author.
Publications by Paper Pagerank
Showing 22 of 72 publications.
| Rank |
Title |
Year |
Venue |
Pagerank |
| 8,268 |
Learned Data-aware Image Representations of Line Charts for Similarity Search |
2023 |
SIGMOD |
4.5456668e-05 |
| 8,385 |
Are Large Language Models a Good Replacement of Taxonomies? |
2024 |
VLDB |
4.5303205e-05 |
| 8,406 |
DADER: Hands-Off Entity Resolution with Domain Adaptation |
2022 |
VLDB |
4.5220083e-05 |
| 8,523 |
Controllable Tabular Data Synthesis Using Diffusion Models |
2024 |
SIGMOD |
4.4937074e-05 |
| 8,828 |
HAIPipe: Combining Human-generated and Machine-generated Pipelines for Data Preparation |
2023 |
SIGMOD |
4.4407488e-05 |
| 8,875 |
CerFix: A System for Cleaning Data with Certain Fixes |
2011 |
VLDB |
4.430475e-05 |
| 9,077 |
VerifAI: Verified Generative AI |
2024 |
CIDR |
4.4010762e-05 |
| 9,221 |
VisClean: Interactive Cleaning for Progressive Visualization |
2020 |
VLDB |
4.3699444e-05 |
| 9,278 |
Interactive and Deterministic Data Cleaning: A Tossed Stone Raises a Thousand Ripples |
2016 |
SIGMOD |
4.3639892e-05 |
| 9,306 |
Debugging Large-Scale Data Science Pipelines using Dagger |
2020 |
VLDB |
4.3572942e-05 |
| 9,479 |
Data Imputation with Limited Data Redundancy Using Data Lakes |
2025 |
VLDB |
4.3341665e-05 |
| 9,577 |
CoClean: Collaborative Data Cleaning |
2020 |
SIGMOD |
4.3248438e-05 |
| 9,810 |
Rheem: Enabling Multi-Platform Task Execution |
2016 |
SIGMOD |
4.278405e-05 |
| 10,289 |
LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning |
2026 |
VLDB |
4.1945683e-05 |
| 10,424 |
Andromeda: Debugging Database Performance Issues with Retrieval-Augmented Large Language Models |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,610 |
Weak-to-Strong Prompts with Lightweight-to-Powerful LLMs for High-Accuracy, Low-Cost, and Explainable Data Transformation |
2025 |
VLDB |
4.1945683e-05 |
| 10,682 |
AutoPrep: Natural Language Question-Aware Data Preparation with a Multi-Agent Framework |
2025 |
VLDB |
4.1945683e-05 |
| 10,837 |
Natural Language to SQL: State of the Art and Open Problems |
2025 |
VLDB |
4.1945683e-05 |
| 11,000 |
MisDetect: Iterative Mislabel Detection using Early Loss |
2024 |
VLDB |
4.1945683e-05 |
| 11,582 |
Interactively Discovering and Ranking Desired Tuples without Writing SQL Queries |
2020 |
SIGMOD |
4.1945683e-05 |
| 13,283 |
DeepTrack: Monitoring and Exploring Spatio-Temporal Data – A Case of Tracking COVID-19 – |
2020 |
VLDB |
- |
| 13,340 |
Errata for "Lightning Fast and Space Efficient Inequality Joins" (PVLDB 8(13): 2074-2085) |
2017 |
VLDB |
- |
Frequent Co-authors
Co-authored at least 5 papers.