core dump with gm-shared memory
Glen Beane
beaneg at umcs.maine.edu
Thu Jun 26 10:06:40 EDT 2003
if I build mpiexec to use gm-shmem on SMP nodes, mpiexec causes a
segmentation fault, but it is always after my MPI program has finished
properly, so it seems to be when mpiexec is cleaning up.
If I build mpiexec without gm-shmem there are no problems.
gm-shmem has been changed slighly on my system. After discussing some
problems with myricom we decided to change the default location of the
shared memory file on our system(done by editing gmpi_smppriv.c and
mpirun.ch_gm.pl). Since /tmp was NFS mounted, we were having problems
with a large number of nodes writing shared memory files to /tmp. The
shared memory file is now located in ramdisk( location of the shared
memory file will likely be a configurable option in the next MPICH/GM
release)
This setup works fine with mpirun.ch_gm, but has been causing
segmentation faults with mpiexec which don't seem to affect the actual
MPI program.
Since mpirun.ch_gm.pl references the temp file, I was wondering if
mpiexec did anywhere, but looking quickly through the source code I
didn't find any reference to it.
Does anyone know what might be causing the problem? Other than the
inability to use gm-shmem, we really like mpiexec so far.
More information about the mpiexec
mailing list