Database Paper Browser

Back to authors

Nan Tang

Author ID
1260
ORCID
0000-0003-2832-0295
Links
(found by gpt-5.2 on feb 8th, 2026)
Most Frequent Institution
Qatar Computing Research Institute
Pagerank
0.55638652
Overall Rank
47 | 99.78%
Paper Count
72

Affiliation Timeline

Incoming Non-self Citations Over Time

Total yearly non-self incoming citations across all papers by this author.

Publications by Paper Pagerank

Showing 50 of 72 publications.

Rank Title Year Venue Pagerank
754 Distributed Representations of Tuples for Entity Resolution 2018 VLDB 0.00017117211
1,012 NADEEF: A Commodity Data Cleaning System 2013 SIGMOD 0.0001464733
1,159 Towards Certain Fixes with Editing Rules and Master Data 2010 VLDB 0.00013592813
1,277 The Data Civilizer System 2017 CIDR 0.00012879695
1,414 Graph Pattern Matching: From Intractable to Polynomial Time 2010 VLDB 0.00012118275
1,541 Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes 2023 CIDR 0.00011456579
1,546 KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing 2015 SIGMOD 0.00011446851
1,612 Detecting Data Errors: Where are we and what needs to be done? 2016 VLDB 0.00011142794
1,831 Synthesizing Entity Matching Rules by Examples 2018 VLDB 0.00010384082
1,892 Querying Shortest Paths on Time Dependent Road Networks 2019 VLDB 0.00010185573
2,349 RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation 2021 VLDB 8.9876423e-05
2,607 Graph Stream Summarization: From Big Bang to Big Crunch 2016 SIGMOD 8.4630211e-05
2,823 Interaction between Record Matching and Data Repairing 2011 SIGMOD 8.0593894e-05
2,945 Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning 2023 SIGMOD 7.8377395e-05
2,946 BigDansing: A System for Big Data Cleansing 2015 SIGMOD 7.8372441e-05
2,968 Raha: A Configuration-Free Error Detection System 2019 SIGMOD 7.7985097e-05
3,192 Towards Dependable Data Repairing with Fixing Rules 2014 SIGMOD 7.4095761e-05
3,265 RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! - 2018 VLDB 7.3083672e-05
3,449 Learned Cardinality Estimation: A Design Space Exploration and A Comparative Evaluation 2022 VLDB 7.0824319e-05
3,571 Lightning Fast and Space Efficient Inequality Joins 2015 VLDB 6.9580858e-05
3,582 NADEEF/ER: Generic and Interactive Entity Resolution 2014 SIGMOD 6.9479263e-05
3,640 Deep Learning for Blocking in Entity Matching: A Design Space Exploration 2021 VLDB 6.8891671e-05
3,662 The Dawn of Natural Language to SQL: Are We Fully Ready? 2024 VLDB 6.8672143e-05
3,861 Generating Concise Entity Matching Rules 2017 SIGMOD 6.6878164e-05
3,970 HAIChart: Human and AI Paired Visualization System 2024 VLDB 6.5784767e-05
3,976 UGuide – User-Guided Discovery of FD-Detectable Errors 2017 SIGMOD 6.5736462e-05
4,102 GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data 2023 SIGMOD 6.4522929e-05
4,212 Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration 2023 SIGMOD 6.3555142e-05
4,825 Synthesizing Natural Language to Visualization (NL2VIS) Benchmarks from NL2SQL Benchmarks 2021 SIGMOD 5.8946721e-05
4,908 Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL 2024 VLDB 5.8339245e-05
5,028 Adaptive Data Augmentation for Supervised Learning over Missing Data 2021 VLDB 5.7506746e-05
5,058 A Demo of the Data Civilizer System 2017 SIGMOD 5.7280139e-05
5,192 Pattern Functional Dependencies for Data Cleaning 2020 VLDB 5.6375087e-05
5,205 ANMAT: Automatic Knowledge Discovery and Error Detection through Pattern Functional Dependencies 2019 SIGMOD 5.630869e-05
5,381 Selective Data Acquisition in the Wild for Model Charging 2022 VLDB 5.5399508e-05
5,462 RetClean: Retrieval-Based Data Cleaning Using LLMs and Data Lakes 2024 VLDB 5.494769e-05
5,469 Learned Cardinality Estimation for Similarity Queries 2021 SIGMOD 5.4898192e-05
5,484 DeepEye: Creating Good Data Visualizations by Keyword Search 2018 SIGMOD 5.4826544e-05
5,684 Dagger: A Data (not code) Debugger 2020 CIDR 5.3720749e-05
5,729 KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing 2015 VLDB 5.3506368e-05
5,963 Automatic Data Acquisition for Deep Learning 2021 VLDB 5.2526794e-05
6,280 Self-supervised and Interpretable Data Cleaning with Sequence Generative Adversarial Networks 2023 VLDB 5.1290457e-05
6,350 NADEEF: A Generalized Data Cleaning System 2013 VLDB 5.101815e-05
6,569 Domain Adaptation for Deep Entity Resolution 2022 SIGMOD 5.0065379e-05
6,765 Automatic Database Configuration Debugging using Retrieval-Augmented Language Models 2025 SIGMOD 4.9325583e-05
6,842 Towards Democratizing Relational Data Visualization 2019 SIGMOD 4.9103931e-05
7,179 Coresets over Multiple Tables for Feature-rich and Data-efficient Machine Learning 2023 VLDB 4.8078895e-05
7,582 LakeCompass: An End-to-End System for Data Maintenance, Search and Analysis in Data Lakes 2024 VLDB 4.7046388e-05
8,000 Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics 2019 VLDB 4.6092803e-05
8,116 LakeBench: A Benchmark for Discovering Joinable and Unionable Tables in Data Lakes 2024 VLDB 4.581507e-05
Previous Page 1 / 2 Next

Frequent Co-authors

Co-authored at least 5 papers.

Co-author Shared Papers Rank Pagerank
Guoliang Li 26 8 0.98178505
Mourad Ouzzani 25 123 0.3399196
Ju Fan 19 158 0.29624962
Chengliang Chai 18 211 0.24025524
Yuyu Luo 16 427 0.13303167
Samuel R. Madden 13 1 1.3916842
Xiaoyong Du 11 115 0.35857203
Paolo Papotti 11 139 0.31420438
Lei Cao 10 146 0.30898998
Ahmed K. Elmagarmid 10 208 0.24300038
Jorge-Arnulfo Quiané-Ruiz 10 221 0.2307558
Michael Stonebraker 9 6 1.0621118
Ihab F. Ilyas 9 64 0.48430143