Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published January 2011 | public
Journal Article

MonALISA-based Grid monitoring and control

Abstract

High-Energy Physics experiments like ALICE at LHC require petabytes of storage and thousand of CPU working in parallel to store, reconstruct and analyze the collected data. This computing power is provided by aggregating the resources of hundreds of institutes and research centers and in addition several purpose-built large computing centers. All these resources are transparently available to the users under the umbrella of ALICE Grid. To ensure smooth operation of this complex distributed machinery we have developed a set of tools to monitor and control the various services, based on the MonALISA monitoring framework. By integrating monitoring information in the system we have achieved a high degree of automation and have significantly reduced the burden on the Grid managers. In this article we present how we collect the monitoring information and a few of the tools that make use of it.

Additional Information

© 2011 Società Italiana di Fisica; Springer-Verlag. Received: 30 October 2010; Revised: 19 December 2010 Published online: 21 January 2011.

Additional details

Created:
August 19, 2023
Modified:
October 23, 2023