Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published June 2012 | public
Book Section - Chapter

FlashBlades: System architecture and applications

Abstract

Presented is a speculative server blade architecture called a FlashBlade that combines 100x I/O performance in both latency and bandwidth with balanced computing. The blade consists of a standard multi-core CPU with attached DRAM. It uses a fast interconnect, such as Intel's QuickPath, to communicate with a FPGA router called the X1. This router handles traffic to the C1 complexes and off-blade. Each C1 complex is a System on a Chip with Package on Package DRAM, connected to local flash memory. There are numerous complexes, giving tremendous I/O performance and computational balance. A large design space of parameters such as flash size, number of complexes, and link bandwidth between each C1 and the X1 is available for power and performance optimization. A single blade server constructed from these blades, just 12.25 inches high and drawing about 10 KW, could support a few hundred thousand basic web searches a second on 1 billion pages. It could also provide triple store performance 100x greater than achievable now for datasets of 6 TB and scales to petabyte datasets although at somewhat reduced performance; with numerous applications to defense, commerce, and science.

Additional Information

© 2012 ACM. The author is grateful to Allan Cantle of Nallatech, Peter Feeley of Micron, John McCorquodale of Caltech, and Prasanna Sundararajan of Xilinx, for many useful discussions particularly related to the issues of viability. This work was supported by the Department of Energy National Nuclear Security Administration under Award Number DEFC52-08NA28613.

Additional details

Created:
August 19, 2023
Modified:
October 20, 2023