Performance Analysis of Blue Gene/L Using Parallel Discrete Event Simulation
Abstract
High performance computers currently under construction, such as IBM's Blue Gene/L, consisting of large numbers (64K) of low cost processing elements with relatively small local memories (256MB) connected via relatively low bandwidth (0.375 Bytes/FLOP) low cost interconnection networks promise exceptional cost-performance for some scientific applications. Due to the large number of processing elements and adaptive routing networks in such systems, performance analysis of meaningful application kernels requires innovative methods. This paper describes a method that combines application analysis, tracing and parallel discrete event simulation to provide early performance prediction. Specifically, results of performance analysis of a Lennard-Jones Spatial (LJS) Decomposition molecular dynamics benchmark code for Blue Gene/L are given.
Additional Information
© 2003 Springer-Verlag Berlin Heidelberg. This work was performed under contract with Lawrence Livermore National Laboratory, Contract No. B520721 (Applications Requirements Machine Model Simulator).Additional details
- Eprint ID
- 99199
- Resolver ID
- CaltechAUTHORS:20191009-140018524
- B520721
- Livermore National Laboratory
- Created
-
2019-10-09Created from EPrint's datestamp field
- Updated
-
2021-11-16Created from EPrint's last_modified field
- Series Name
- Lecture Notes in Computer Science
- Series Volume or Issue Number
- 2913