Wierd GM/MPIexec error

Pete Wyckoff pw at osc.edu
Wed Aug 13 17:22:51 EDT 2003


cdmaest at sandia.gov said on Wed, 13 Aug 2003 14:12 -0600:
> Has anyone seen the following occur? 
> 
> ---
> mpiexec - Version 0.74, configure options: --prefix=/apps/mpiexec --with-pbs=/apps/openpbs/ --with-mpicc=mpicc --with-default-comm=mpich-gm
> ---
> mpich-gm - 1.2.5..10
> ---
> 
> ===
> [cn1 examples]$ mpiexec -np 1 -v cpi
> resolve_exe: prefixing dot to executable: "./cpi"
> node  0: name = cn1, mpname = cn1, cpu = 1
> wait_one_task_start: evt = 2, task 0 host cn1
> All 1 task started.
> read_gm_startup_ports: waiting for info
> read_gm_startup_ports: mpich gm version 12510
> read_gm_startup_ports: id 0 port 2 board 0 gm_node_id 2147483647
>   numanode 0 pid  8992 remote_port  8000
> [0] Error: Unable to translate GM global node id (2147483647)to local node id for the MPI id
> 0 !
> mpiexec: Warning: accept_abort_conn: MPI_Abort from IP 172.30.1.7, killing all.
> wait_tasks: got evt 0, did not match any
> wait_tasks: numspawned = 1, got evt 4 for tid 3 host cn1 status 0
> ===

It looks like GM is reporting that the node id is -1.  It gives that to
mpiexec, which dutifully returns it back to the process, which then
complains that the number is not found.  All else looks okay.

If you do:
	gm_board_info | grep "This is"
do you see something other than 2147483647 for the node id?
Can you get the code to run using mpirun?

		-- Pete



More information about the mpiexec mailing list