Pelegant stands for ``parallel elegant,'' which is a parallelized
version of elegant [1]. Written in the C programming
language with MPICH, the Pelegant has been successfully ported to
several clusters and supercomputers, such as the ``weed'' cluster (a
heterogeneous system of 100 CPUs) at Advanced Photon Source (APS),
and the Jazz cluster (350 Intel
Xeon CPUs) at Argonne National Lab (ANL) and the BlueGene/L supercomputer
(1024 dual PowerPC 440 nodes) at Argonne National Lab. Thanks to
careful design in parallelization and good architecture of the serial
elegant, the Pelegant achieves very good performance. For example, for a
simulation of
particles in APS including symplectic
element-by-element tracking, accelerating cavities, and crab cavities,
the simulation time was reduced from 14.3 days to 42 minutes on 512
CPUs of the BlueGene/L (BG/L) supercomputer. The speedup for this
particular simulation is 484 with efficiency near 95%.
This document describes how to build Pelegant, run the code and optimize the performance. Finally, appendices are included describing which elements have been parallelized and which commands have been used in the regression tests. The user should be familiar with the User's Manual for elegant before reading this document.