Mpiexec release 0.75, InfiniBand support, restart PBS mom, more
Pete Wyckoff
pw at osc.edu
Fri Dec 5 17:05:51 EST 2003
Changes from the previous version are quite extensive, as it has been
seven months since the last release. If any of the following topics
interest you, please give the new version a try.
New communication library: MPICH on InfiniBand
InfiniBand is a high-speed interconnect that is becoming popular in
the message passing world. The most popular implementation of MPI
is the one from OSU/CIS based on MPICH and supported in this release
of mpiexec.
PBS Mom restart
Add support to reconnect to PBS moms which are restarted during the
run of a parallel application. This requires changes to PBS to work
properly which are included in a new patch to OpenPBS found in this
mpiexec distribution.
It is a somewhat complex patch which fixes numerous crashes in the
PBS code itself. This support is still marked experimental, but
give it a shot if you are interested in the ability to restart moms
under running parallel processes.
MPIEXEC_RANK environment variable
Many users take advantage of the "none" communication library to
automate system tasks in the context of a PBS job. There is now an
environment variable which gives a different number to each task
similar to the rank in an MPI implementation. Thanks to Jose Luis
Gordillo Ruiz and Eduardo Murrieta Leon for the idea and patch.
MPICH/P4 debugging fix
Process arguments were moved around to allow debugging inside an
xterm. This always had worked for the other communication
libraries. Try "mpiexec xterm -e gdb --args mycode" to see it in
action.
MPICH/P4 shmem command-line flag
In the ongoing saga of the "--with-comm=shared" compile-time flag
for the MPICH/P4 libmpich.a library itself, this adds a bit more
flexibility. Now mpiexec allows runtime specification of the
shared-memory support of your mpich library. You will be much
happier if you compile mpiexec so that it knows if your mpich/p4
library uses --comm=shmem or not, but this flag is handy in testing,
or at sites which must support both types.
MPICH/GM GM2 bug fix
Chris Maestas of Sandia found and fixed a problem when using mpiexec
with MPICH/GM with the GM2 library. This GM2 library is a major
version change from the GM library most use now to drive their
Myricom devices, and thus not yet widely adopted.
Full changelog and downloads at: http://www.osc.edu/~pw/mpiexec/
Respond to the list with bug reports, comments, suggestions,
and complaints.
-- Pete
More information about the mpiexec
mailing list