A Hadoop Based Distributed Loading Approach to Parallel Data Warehouses
Summary: Hadoop as distributed ETL loader to Teradata EDW, leveraging HDFS for scalable, parallel loading. Polynomial-time optimal and approximate HDFS-block to Teradata-unit assignment minimizes network traffic; MapReduce enables transformation of un/ semi-structured data; experiments show gains. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Yu Xu
- 2. Pekka Kostamaa
- 3. Yan Qi
- 4. Jian Wen
- 5. Kevin Keliang Zhao
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,977 | Split Query Processing in Polybase | 2013 | SIGMOD | 9.8824589e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 780 | Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience | 2009 | VLDB | 0.00016775082 |
| 794 | Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing) | 2010 | VLDB | 0.00016605103 |
| 1,863 | Cheetah: A High Performance, Custom Data Warehouse on Top of MapReduce | 2010 | VLDB | 0.00010286531 |
| 3,517 | Integrating Hadoop and Parallel DBMS | 2010 | SIGMOD | 7.0199423e-05 |
| 5,838 | HadoopDB in Action: Building Real World Applications | 2010 | SIGMOD | 5.3059032e-05 |
Previous
Page 1 / 1
Next