Wide Table Layout Optimization based on Column Ordering and Duplication
Summary: Wide-table layout for column stores on HDFS: jointly optimize column ordering and duplication with fine I/O cost model. Workload-driven algorithms output column orders with optional duplication to minimize disk seeks and query contention; validated on data and workloads. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Haoqiong Bian
- 2. Ying Yan
- 3. Wenbo Tao
- 4. Liang Jeff Chen
- 5. Yueguo Chen
- 6. Xiaoyong Du
- 7. Thomas Moscibroda
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,514 | An Empirical Evaluation of Columnar Storage Formats | 2024 | VLDB | 6.1204636e-05 |
| 5,441 | Using Cloud Functions as Accelerator for Elastic Data Analytics | 2023 | SIGMOD | 5.5028093e-05 |
| 10,117 | AixelAsk: A Stepwise-Guided Retrieval and Reasoning Framework for Large Table QA | 2026 | SIGMOD | 4.1945683e-05 |
| 11,067 | Partition, Don’t Sort! Compression Boosters for Cloud Data Ingestion Pipelines | 2024 | VLDB | 4.1945683e-05 |
| 11,175 | Grouping Time Series for Efficient Columnar Storage | 2023 | SIGMOD | 4.1945683e-05 |
| 11,545 | Pixels: Multiversion Wide Table Store for Data Lakes | 2020 | CIDR | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 17 of 17 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,014 | Dynamically Optimizing Queries over Large Scale Data Platforms | 2014 | SIGMOD | 5.7586174e-05 |
| 4,514 | An Empirical Evaluation of Columnar Storage Formats | 2024 | VLDB | 6.1204636e-05 |
| 1,949 | Positional Update Handling in Column Stores | 2010 | SIGMOD | 9.9864085e-05 |
| 5,465 | Workload-Aware Storage Layout for Database Systems | 2010 | SIGMOD | 5.4919488e-05 |
| 710 | Performance Tradeoffs in Read-Optimized Databases | 2006 | VLDB | 0.00017765454 |
| 5,898 | Column Partition and Permutation for Run Length Encoding in Columnar Databases | 2020 | SIGMOD | 5.2839046e-05 |
| 3,737 | Skipping-oriented Partitioning for Columnar Layouts | 2017 | VLDB | 6.8033227e-05 |
| 3,488 | Optimal Column Layout for Hybrid Workloads | 2019 | VLDB | 7.0479329e-05 |
| 3,208 | Column-Oriented Storage Techniques for MapReduce | 2011 | VLDB | 7.3781897e-05 |
| 6,802 | Understanding Insights into the Basic Structure and Essential Issues of Table Placement Methods in Clusters | 2013 | VLDB | 4.9226626e-05 |