problems with voltaire mpi.

Roy Dragseth Roy.Dragseth at cc.uit.no
Fri Dec 1 18:23:17 EST 2006


Hi.

When I try to start an application linked with the voltaire infiniband 
using -comm=ib I get the following error:

Can't read SMP_ID

and if I use -comm=pmi i get 

Can't read MPIRUN_HOST

When I use the included mpirun from the voltaire stack it works as expected, 
also mpiexec works fine with mpich_chp4 applications.

Any obvious mistakes I'm making here?

Full output included below.

System info:

MPICH:
$ /usr/voltaire/mpi/bin/mpichversion
MPICH Version:          1.2.6
MPICH Release date:     $Date: 2004/09/20 12:35:29 $
MPICH Patches applied:  none
MPICH 
configure:        --with-device=vapi --with-arch=LINUX -prefix=/usr/voltaire/mpi.gcc.rsh --enable-sharedlib -lib=-Wl,-R/usr/voltaire/mpi.gcc.rsh/lib -Wl,-R/usr/voltaire/mpi.gcc.rsh/lib/shared -Wl,-R/usr/mellanox/lib -L/usr/mellanox/lib -L/usr/voltaire/mpi.gcc.rsh/lib -L/usr/voltaire/mpi.gcc.rsh/lib/shared -lmtl_common -lvapi -lmosal -lmpga -lpthread
MPICH Device:           vapi

OS:
RHEL 4, linux 2.6.9-34.

$ ../src/mpiexec-0.82/mpiexec -comm=ib  -v ./bandwidth.volt.x
mpiexec: resolve_exe: using absolute path "./bandwidth.volt.x".
Can't read SMP_ID
Can't read SMP_ID
mpiexec: process_start_event: evt 2 task 0 on compute-1-9.local.
mpiexec: process_start_event: evt 3 task 1 on compute-1-0.local.
mpiexec: All 2 tasks (spawn 0) started.
mpiexec: read_ib_startup_ports: waiting for checkin: 2 to accept, 0 to read.
mpiexec: process_obit_event: evt 4 task 0 on compute-1-9.local stat 1.
mpiexec: kill_tasks: killing all tasks.
mpiexec: wait_tasks: waiting for compute-1-0.local.
mpiexec: process_obit_event: evt 5 task 1 on compute-1-0.local stat 1.
mpiexec: process_kill_event: evt 6 task 1 on compute-1-0.local.
mpiexec: Warning: tasks 0-1 exited before completing MPI startup.

$ ../src/mpiexec-0.82/mpiexec -comm=pmi  -v ./bandwidth.volt.x
mpiexec: resolve_exe: using absolute path "./bandwidth.volt.x".
Can't read MPIRUN_HOST
Can't read MPIRUN_HOST
mpiexec: process_start_event: evt 2 task 0 on compute-1-9.local.
mpiexec: process_start_event: evt 3 task 1 on compute-1-0.local.
mpiexec: All 2 tasks (spawn 0) started.
mpiexec: wait_tasks: waiting for compute-1-9.local and 1 others.
mpiexec: process_obit_event: evt 4 task 0 on compute-1-9.local stat 1.
mpiexec: wait_tasks: waiting for compute-1-0.local.
mpiexec: process_obit_event: evt 5 task 1 on compute-1-0.local stat 1.
mpiexec: Warning: tasks 0-1 exited with status 1.

Best regards,
r.

-- 

  The Computer Center, University of Tromsø, N-9037 TROMSØ, Norway.
	      phone:+47 77 64 41 07, fax:+47 77 64 41 00
     Roy Dragseth, High Performance Computing System Administrator
	 Direct call: +47 77 64 62 56. email: royd at cc.uit.no


More information about the mpiexec mailing list