Efficient Bulk Insertion into a Distributed Ordered Table
Summary: Pre-insertion planning creates and redistributes partitions to balance bulk inserts across a distributed ordered table. Frames the load-balancing as a partition-movement vs insertion-time tradeoff, reduced to a vector-packing variant of NP-hard bin-packing with guarantees; validated on a 50-node cluster. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Adam Silberstein
- 2. Brian F. Cooper
- 3. Utkarsh Srivastava
- 4. Erik Vee
- 5. Ramana Yerneni
- 6. Raghu Ramakrishnan
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 53 | PNUTS: Yahoo!'s Hosted Data Serving Platform | 2008 | VLDB | 0.00066144767 |
| 1,985 | A Practical Scalable Distributed B-Tree | 2008 | VLDB | 9.8569956e-05 |
| 4,857 | The "Big Data" Ecosystem at LinkedIn | 2013 | SIGMOD | 5.8736144e-05 |
| 9,349 | A Framework for Supporting DBMS-like Indexes in the Cloud | 2011 | VLDB | 4.3526413e-05 |
| 9,419 | DITIR: Distributed Index for High Throughput Trajectory Insertion and Real-time Temporal Range Query | 2017 | VLDB | 4.3441378e-05 |
| 9,574 | PNUTS to Sherpa: Lessons from Yahoo!'s Cloud Database | 2019 | VLDB | 4.3252874e-05 |
| 12,226 | Indexing Multi-dimensional Data in a Cloud System | 2010 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 53 | PNUTS: Yahoo!'s Hosted Data Serving Platform | 2008 | VLDB | 0.00066144767 |
| 875 | Algorithms for Creating Indexes for Very Large Tables Without Quiescing Updates | 1992 | SIGMOD | 0.00015719411 |
| 1,077 | Incremental Organization for Data Recording and Warehousing | 1997 | VLDB | 0.00014247204 |
| 2,136 | A Generic Approach to Bulk Loading Multidimensional Index Structures | 1997 | VLDB | 9.4721139e-05 |
| 4,756 | OODB Bulk Loading Revisited: The Partitioned-List Approach | 1995 | VLDB | 5.9450079e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,477 | Fine-grained Partitioning for Aggressive Data Skipping | 2014 | SIGMOD | 0.00011770865 |
| 9,918 | Shared Load(ing): Efficient Bulk Loading into Optimized Storage | 2020 | CIDR | 4.2561557e-05 |
| 8,150 | Parallelism-Optimizing Data Placement for Faster Data-Parallel Computations | 2023 | VLDB | 4.5746638e-05 |
| 2,413 | Automated Partitioning Design in Parallel Database Systems | 2011 | SIGMOD | 8.8672223e-05 |
| 6,694 | Optimal Splitters for Temporal and Multi-version Databases | 2013 | SIGMOD | 4.9586454e-05 |
| 285 | Automating Physical Database Design in a Parallel Database | 2002 | SIGMOD | 0.0002899128 |
| 679 | Skew-Aware Automatic Database Partitioning in Shared-Nothing, Parallel OLTP Systems | 2012 | SIGMOD | 0.00018215154 |
| 7,913 | Resource Bricolage for Parallel Database Systems | 2015 | VLDB | 4.6180739e-05 |
| 1,347 | Online Balancing of Range-Partitioned Data with Applications to Peer-to-Peer Systems | 2004 | VLDB | 0.00012456657 |
| 12,330 | Adaptively Parallelizing Distributed Range Queries | 2009 | VLDB | 4.1945683e-05 |