Database Paper Browser

Back to authors

Eugene Wu

Author ID
154
ORCID
-
Links
(found by gpt-5.2 on feb 09 2026)
Most Frequent Institution
Columbia University
Pagerank
0.50192075
Overall Rank
59 | 99.73%
Paper Count
61

Affiliation Timeline

Incoming Non-self Citations Over Time

Total yearly non-self incoming citations across all papers by this author.

Publications by Paper Pagerank

Showing 50 of 61 publications.

Rank Title Year Venue Pagerank
107 WebTables: Exploring the Power of Tables on the Web 2008 VLDB 0.00048377684
214 Scorpion: Explaining Away Outliers in Aggregate Queries 2013 VLDB 0.0003363692
249 Crowdsourced Databases: Query Processing with People 2011 CIDR 0.00030740523
259 High-Performance Complex Event Processing over Streams 2006 SIGMOD 0.00030174924
267 Human-powered Sorts and Joins 2012 VLDB 0.00029690405
664 Relational Cloud: A Database-as-a-Service for the Cloud 2011 CIDR 0.00018465843
791 ActiveClean: Interactive Data Cleaning For Statistical Modeling 2016 VLDB 0.00016629664
911 Design Considerations for High Fan-in Systems: The HiFi Approach 2005 CIDR 0.00015419842
1,963 DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing 2025 VLDB 9.929429e-05
2,123 Demonstration of Qurk: A Query Processor for Human Operators 2011 SIGMOD 9.4945521e-05
2,280 SMOKE: Fine-grained Lineage at Interactive Speed 2018 VLDB 9.1111033e-05
2,340 SASE: Complex Event Processing over Streams 2007 CIDR 9.004232e-05
2,709 Vertexica: Your Relational Friend for Graph Analytics! 2014 VLDB 8.2530203e-05
2,733 The Case for Data Visualization Management Systems [Vision Paper] 2014 VLDB 8.2078862e-05
2,753 Complaint-driven Training Data Debugging for Query 2.0 2020 SIGMOD 8.1724339e-05
3,155 Ten Years of WebTables 2018 VLDB 7.4672742e-05
3,280 The Case for RodentStore, an Adaptive, Declarative Storage System 2009 CIDR 7.2828962e-05
3,347 Collaborative Data Analytics with DataHub 2015 VLDB 7.1921364e-05
3,508 spade: Synthesizing Data Quality Assertions for Large Language Model Pipelines 2024 VLDB 7.0271496e-05
3,737 Skipping-oriented Partitioning for Columnar Layouts 2017 VLDB 6.8033227e-05
4,451 CLAMShell: Speeding up Crowds for Low-latency Data Labeling 2016 VLDB 6.1738675e-05
4,635 Mining Precision Interfaces From Query Logs 2019 SIGMOD 6.033398e-05
4,877 Precision Interfaces for Different Modalities 2018 SIGMOD 5.8593569e-05
5,222 Enabling SQL-based Training Data Debugging for Federated Learning 2022 VLDB 5.6210545e-05
5,445 QFix: Diagnosing Errors through Query Histories 2017 SIGMOD 5.5020909e-05
5,560 PI2: End-to-end Interactive Visualization Interface Generation from Queries 2022 SIGMOD 5.4336252e-05
5,867 Combining Design and Performance in a Data Visualization Management System 2017 CIDR 5.296418e-05
5,929 ActiveClean: An Interactive Data Cleaning Framework For Modern Machine Learning 2016 SIGMOD 5.2682177e-05
6,069 OM3: An Ordered Multi-level Min-Max Representation for Interactive Progressive Visualization of Time Series 2023 SIGMOD 5.2280784e-05
6,077 The Fast and the Private: Task-based Dataset Search 2024 CIDR 5.2229324e-05
6,373 DeepBase: Deep Inspection of Neural Networks 2019 SIGMOD 5.0929326e-05
6,384 A Demonstration of DBWipes: Clean as You Query 2012 VLDB 5.0880333e-05
6,541 ConnectorX: Accelerating Data Loading From Databases to Dataframes 2022 VLDB 5.0216945e-05
6,556 Demonstration of PI2: Interactive Visualization Interface Generation for SQL Analysis in Notebook 2022 SIGMOD 5.0148305e-05
6,779 Explaining Inference Queries with Bayesian Optimization 2021 VLDB 4.9280116e-05
6,842 Towards Democratizing Relational Data Visualization 2019 SIGMOD 4.9103931e-05
6,907 Continuous Prefetch for Interactive Data Applications 2020 VLDB 4.8925595e-05
7,206 HiFi: A Unified Architecture for High Fan-in Systems (System Demonstration) 2004 VLDB 4.8008153e-05
7,491 Saibot: A Differentially Private Data Search Platform 2023 VLDB 4.7180617e-05
7,807 Pollock: A Data Loading Benchmark 2023 VLDB 4.6457732e-05
7,920 JoinBoost: Grow Trees Over Normalized Data Using Only SQL 2023 VLDB 4.6163888e-05
8,593 Wisteria: Nurturing Scalable Data Cleaning Infrastructure 2015 VLDB 4.4891474e-05
8,678 Progressive Deep Web Crawling Through Keyword Queries For Data Enrichment 2019 SIGMOD 4.4702119e-05
8,853 Complaint-Driven Training Data Debugging at Interactive Speeds 2022 SIGMOD 4.4350727e-05
9,273 ActiveDeeper: A Model-based Active Data Enrichment System 2020 VLDB 4.3649603e-05
9,849 Reptile: Aggregation-level Explanations for Hierarchical Data 2022 SIGMOD 4.2721228e-05
9,968 Please Don't Kill My Vibe: Empowering Agents with Data Flow Control 2026 CIDR 4.1945683e-05
10,127 Visualization-Oriented Progressive Time Series Transformation 2026 SIGMOD 4.1945683e-05
10,496 Physical Visualization Design: Decoupling Interface and System Design 2025 SIGMOD 4.1945683e-05
10,725 Suna: Scalable Causal Confounder Discovery over Relational Data 2025 VLDB 4.1945683e-05
Previous Page 1 / 2 Next

Frequent Co-authors

Co-authored at least 5 papers.

Co-author Shared Papers Rank Pagerank
Jiannan Wang 13 144 0.3101392
Samuel R. Madden 12 1 1.3916842
Michael J. Franklin 7 11 0.94105828
Lampros Flokas 6 2,048 0.036760095
Weiyuan Wu 5 1,252 0.054955674
Zezhou Huang 5 1,822 0.040379447