Back to authors
Reynold Xin
- Author ID
- 1795
- ORCID
-
-
- Links
-
(found by gpt-5.2 on feb 09 2026)
- Most Frequent Institution
- Databricks
- Pagerank
- 0.21746365
- Overall Rank
- 239 | 98.88%
- Paper Count
- 23
Affiliation Timeline
Incoming Non-self Citations Over Time
Total yearly non-self incoming citations across all papers by this author.
Publications by Paper Pagerank
Showing 23 of 23 publications.
| Rank |
Title |
Year |
Venue |
Pagerank |
| 66 |
Spark SQL: Relational Data Processing in Spark |
2015 |
SIGMOD |
0.00061639801 |
| 94 |
CrowdDB: Answering Queries with Crowdsourcing |
2011 |
SIGMOD |
0.00051013264 |
| 542 |
Shark: SQL and Rich Analytics at Scale |
2013 |
SIGMOD |
0.00020595648 |
| 746 |
Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores |
2020 |
VLDB |
0.00017326979 |
| 818 |
Finding Related Tables |
2012 |
SIGMOD |
0.00016311524 |
| 1,377 |
Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics |
2021 |
CIDR |
0.00012296941 |
| 1,477 |
Fine-grained Partitioning for Aggressive Data Skipping |
2014 |
SIGMOD |
0.00011770865 |
| 1,548 |
Structured Streaming: A Declarative API for Real-Time Applications in Apache Spark |
2018 |
SIGMOD |
0.00011431383 |
| 1,885 |
CrowdDB: Query Processing with the VLDB Crowd |
2011 |
VLDB |
0.0001021098 |
| 2,473 |
Photon: A Fast Query Engine for Lakehouse Systems |
2022 |
SIGMOD |
8.7237281e-05 |
| 2,488 |
Shark: Fast Data Analysis Using Coarse-grained Distributed Memory |
2012 |
SIGMOD |
8.6683713e-05 |
| 3,535 |
Scaling Spark in the Real World: Performance and Usability |
2015 |
VLDB |
6.9992495e-05 |
| 6,784 |
SparkR: Scaling R Programs with Spark |
2016 |
SIGMOD |
4.9265155e-05 |
| 7,059 |
Adaptive and Robust Query Execution for Lakehouses at Scale |
2024 |
VLDB |
4.8477825e-05 |
| 8,608 |
Unity Catalog: Open and Universal Governance for the Lakehouse and Beyond |
2025 |
SIGMOD |
4.4853979e-05 |
| 9,016 |
Making Data Engineering Declarative |
2023 |
CIDR |
4.4094312e-05 |
| 9,093 |
Databricks Lakeguard: Supporting Fine-grained Access Control and Multi-user Capabilities for Apache Spark Workloads |
2025 |
SIGMOD |
4.398149e-05 |
| 9,584 |
Introduction to Spark 2.0 for Database Researchers |
2016 |
SIGMOD |
4.3218691e-05 |
| 9,860 |
MEET DB2: Automated Database Migration Evaluation |
2010 |
VLDB |
4.269353e-05 |
| 11,993 |
A Partitioning Framework for Aggressive Data Skipping |
2014 |
VLDB |
4.1945683e-05 |
| 12,321 |
Linkage Query Writer |
2009 |
VLDB |
4.1945683e-05 |
| 13,096 |
Blink Twice - Automatic Workload Pinning and Regression Detection for Versionless Apache Spark using Retries |
2025 |
SIGMOD |
- |
| 13,124 |
Delta Sharing: An Open Protocol for Cross-Platform Data Sharing |
2025 |
VLDB |
- |
Frequent Co-authors
Co-authored at least 5 papers.