Wierd GM/MPIexec error
Christopher D. Maestas
cdmaest at sandia.gov
Wed Aug 13 23:36:23 EDT 2003
On Wed, 13 Aug 2003, Pete Wyckoff wrote:
> cdmaest at sandia.gov said on Wed, 13 Aug 2003 14:12 -0600:
> > Has anyone seen the following occur?
> >
> > ---
> > mpiexec - Version 0.74, configure options: --prefix=/apps/mpiexec --with-pbs=/apps/openpbs/ --with-mpicc=mpicc --with-default-comm=mpich-gm
> > ---
> > mpich-gm - 1.2.5..10
> > ---
> >
> > ===
> > [cn1 examples]$ mpiexec -np 1 -v cpi
> > resolve_exe: prefixing dot to executable: "./cpi"
> > node 0: name = cn1, mpname = cn1, cpu = 1
> > wait_one_task_start: evt = 2, task 0 host cn1
> > All 1 task started.
> > read_gm_startup_ports: waiting for info
> > read_gm_startup_ports: mpich gm version 12510
> > read_gm_startup_ports: id 0 port 2 board 0 gm_node_id 2147483647
> > numanode 0 pid 8992 remote_port 8000
> > [0] Error: Unable to translate GM global node id (2147483647)to local node id for the MPI id
> > 0 !
> > mpiexec: Warning: accept_abort_conn: MPI_Abort from IP 172.30.1.7, killing all.
> > wait_tasks: got evt 0, did not match any
> > wait_tasks: numspawned = 1, got evt 4 for tid 3 host cn1 status 0
> > ===
>
> It looks like GM is reporting that the node id is -1. It gives that to
> mpiexec, which dutifully returns it back to the process, which then
> complains that the number is not found. All else looks okay.
>
> If you do:
> gm_board_info | grep "This is"
> do you see something other than 2147483647 for the node id?
> Can you get the code to run using mpirun?
>
> -- Pete
>
---
[cn2 examples]$ /apps/gm/bin/gm_board_info | grep "This is"
[cn2 examples]$ /apps/gm/bin/gm_board_info | grep "this"
0: BUSY 21188 (this process [gm_board_info])
Route table for this node follows:
1 00:60:dd:49:82:ab cn2 (this node)
[hpsupt at cn2 examples]$ /apps/gm/bin/gm_board_info | grep "his"
This node is "cn2"
0: BUSY 21190 (this process [gm_board_info])
Route table for this node follows:
1 00:60:dd:49:82:ab cn2 (this node)
---
I guess I forgot that this is against gm 2.0.5 as well. :-)
I can get the code to work with mpirun ...
-- Chris
More information about the mpiexec
mailing list