Published 2011
| public
Journal Article
High Throughput WAN Data Transfer with Hadoop-based Storage
Chicago
Abstract
Hadoop distributed file system (HDFS) is becoming more popular in recent years as a key building block of integrated grid storage solution in the field of scientific computing. Wide Area Network (WAN) data transfer is one of the important data operations for large high energy physics experiments to manage, share and process datasets of PetaBytes scale in a highly distributed grid computing environment. In this paper, we present the experience of high throughput WAN data transfer with HDFS-based Storage Element. Two protocols, GridFTP and fast data transfer (FDT), are used to characterize the network performance of WAN data transfer.
Additional Information
© 2011 Institute of Physics. Published under licence by IOP Publishing Ltd.Additional details
- Eprint ID
- 30048
- Resolver ID
- CaltechAUTHORS:20120410-110725216
- Created
-
2012-04-10Created from EPrint's datestamp field
- Updated
-
2022-07-12Created from EPrint's last_modified field