Nan Tang

Author ID: 1260
ORCID: 0000-0003-2832-0295
Links: (found by gpt-5.2 on feb 8th, 2026)
Most Frequent Institution: Qatar Computing Research Institute
Pagerank: 0.55638652
Overall Rank: 47 | 99.78%
Paper Count: 72

Affiliation Timeline

Qatar Computing Research Institute Most frequent 2013 - 2023 | 50 papers
Hamad Bin Khalifa University 2016 - 2023 | 28 papers
Hong Kong University of Science and Technology 2023 - 2026 | 22 papers
University of Edinburgh 2010 - 2011 | 4 papers

Incoming Non-self Citations Over Time

Total yearly non-self incoming citations across all papers by this author.

Publications by Paper Pagerank

Showing 22 of 72 publications.

Rank	Title	Year	Venue	Pagerank
8,268	Learned Data-aware Image Representations of Line Charts for Similarity Search	2023	SIGMOD	4.5456668e-05
8,385	Are Large Language Models a Good Replacement of Taxonomies?	2024	VLDB	4.5303205e-05
8,406	DADER: Hands-Off Entity Resolution with Domain Adaptation	2022	VLDB	4.5220083e-05
8,523	Controllable Tabular Data Synthesis Using Diffusion Models	2024	SIGMOD	4.4937074e-05
8,828	HAIPipe: Combining Human-generated and Machine-generated Pipelines for Data Preparation	2023	SIGMOD	4.4407488e-05
8,875	CerFix: A System for Cleaning Data with Certain Fixes	2011	VLDB	4.430475e-05
9,077	VerifAI: Verified Generative AI	2024	CIDR	4.4010762e-05
9,221	VisClean: Interactive Cleaning for Progressive Visualization	2020	VLDB	4.3699444e-05
9,278	Interactive and Deterministic Data Cleaning: A Tossed Stone Raises a Thousand Ripples	2016	SIGMOD	4.3639892e-05
9,306	Debugging Large-Scale Data Science Pipelines using Dagger	2020	VLDB	4.3572942e-05
9,479	Data Imputation with Limited Data Redundancy Using Data Lakes	2025	VLDB	4.3341665e-05
9,577	CoClean: Collaborative Data Cleaning	2020	SIGMOD	4.3248438e-05
9,810	Rheem: Enabling Multi-Platform Task Execution	2016	SIGMOD	4.278405e-05
10,289	LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning	2026	VLDB	4.1945683e-05
10,424	Andromeda: Debugging Database Performance Issues with Retrieval-Augmented Large Language Models	2025	SIGMOD	4.1945683e-05
10,610	Weak-to-Strong Prompts with Lightweight-to-Powerful LLMs for High-Accuracy, Low-Cost, and Explainable Data Transformation	2025	VLDB	4.1945683e-05
10,682	AutoPrep: Natural Language Question-Aware Data Preparation with a Multi-Agent Framework	2025	VLDB	4.1945683e-05
10,837	Natural Language to SQL: State of the Art and Open Problems	2025	VLDB	4.1945683e-05
11,000	MisDetect: Iterative Mislabel Detection using Early Loss	2024	VLDB	4.1945683e-05
11,582	Interactively Discovering and Ranking Desired Tuples without Writing SQL Queries	2020	SIGMOD	4.1945683e-05
13,283	DeepTrack: Monitoring and Exploring Spatio-Temporal Data – A Case of Tracking COVID-19 –	2020	VLDB	-
13,340	Errata for "Lightning Fast and Space Efficient Inequality Joins" (PVLDB 8(13): 2074-2085)	2017	VLDB	-

Frequent Co-authors

Co-authored at least 5 papers.

Co-author	Shared Papers	Rank	Pagerank
Guoliang Li	26	8	0.98178505
Mourad Ouzzani	25	123	0.3399196
Ju Fan	19	158	0.29624962
Chengliang Chai	18	211	0.24025524
Yuyu Luo	16	427	0.13303167
Samuel R. Madden	13	1	1.3916842
Xiaoyong Du	11	115	0.35857203
Paolo Papotti	11	139	0.31420438
Lei Cao	10	146	0.30898998
Ahmed K. Elmagarmid	10	208	0.24300038
Jorge-Arnulfo Quiané-Ruiz	10	221	0.2307558
Michael Stonebraker	9	6	1.0621118
Ihab F. Ilyas	9	64	0.48430143