English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Conference Paper

Supporting Efficient Execution in Heterogeneous Distributed Computing Environments with Cactus and Globus

MPS-Authors

Allen,  Gabrielle
Cactus Group, AEI-Golm, MPI for Gravitational Physics, Max Planck Society;

Dramlitsch,  Thomas
Cactus Group, AEI-Golm, MPI for Gravitational Physics, Max Planck Society;

Seidel,  Edward
Cactus Group, AEI-Golm, MPI for Gravitational Physics, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

23676.pdf
(Publisher version), 203KB

Supplementary Material (public)
There is no public supplementary material available
Citation

Allen, G., Dramlitsch, T., Foster, I., Karonis, N. T., Ripeanu, M., Seidel, E., et al. (2001). Supporting Efficient Execution in Heterogeneous Distributed Computing Environments with Cactus and Globus.


Cite as: https://hdl.handle.net/11858/00-001M-0000-0013-56BF-1
Abstract
Improvements in the performance of processors and networks make it both feasible and interesting to treat collections of workstations, servers, clusters, and supercomputers as integrated computational resources, or Grids. However, the highly heterogeneous and dynamic nature of such Grids can make application development di.cult. Here we describe an architecture and prototype implementation for a Grid-enabled computational framework based on Cactus, the MPICH-G2 Grid-enabled message-passing library, and a variety of specialized features to support e.cient execution in Grid environments. We have used this framework to perform record-setting computations in numerical relativity, running across four supercomputers and achieving scaling of 88% (1140 CPU’s) and 63% (1500 CPUs). The problem size we were able to compute was about .ve times larger than any other previous run. Further, we introduce and demonstrate adaptive methods that automatically adjust computational parameters during run time, to increase dramatically the e.ciency of a distributed Grid simulation, without modi.cation of the application and without any knowledge of the underlying network connecting the distributed computers.