Wierd GM/MPIexec error

Christopher D. Maestas cdmaest at sandia.gov
Wed Aug 13 23:36:23 EDT 2003


On Wed, 13 Aug 2003, Pete Wyckoff wrote:

> cdmaest at sandia.gov said on Wed, 13 Aug 2003 14:12 -0600:
> > Has anyone seen the following occur? 
> > 
> > ---
> > mpiexec - Version 0.74, configure options: --prefix=/apps/mpiexec --with-pbs=/apps/openpbs/ --with-mpicc=mpicc --with-default-comm=mpich-gm
> > ---
> > mpich-gm - 1.2.5..10
> > ---
> > 
> > ===
> > [cn1 examples]$ mpiexec -np 1 -v cpi
> > resolve_exe: prefixing dot to executable: "./cpi"
> > node  0: name = cn1, mpname = cn1, cpu = 1
> > wait_one_task_start: evt = 2, task 0 host cn1
> > All 1 task started.
> > read_gm_startup_ports: waiting for info
> > read_gm_startup_ports: mpich gm version 12510
> > read_gm_startup_ports: id 0 port 2 board 0 gm_node_id 2147483647
> >   numanode 0 pid  8992 remote_port  8000
> > [0] Error: Unable to translate GM global node id (2147483647)to local node id for the MPI id
> > 0 !
> > mpiexec: Warning: accept_abort_conn: MPI_Abort from IP 172.30.1.7, killing all.
> > wait_tasks: got evt 0, did not match any
> > wait_tasks: numspawned = 1, got evt 4 for tid 3 host cn1 status 0
> > ===
> 
> It looks like GM is reporting that the node id is -1.  It gives that to
> mpiexec, which dutifully returns it back to the process, which then
> complains that the number is not found.  All else looks okay.
> 
> If you do:
> 	gm_board_info | grep "This is"
> do you see something other than 2147483647 for the node id?
> Can you get the code to run using mpirun?
> 
> 		-- Pete
> 

---
[cn2 examples]$ /apps/gm/bin/gm_board_info | grep "This is"
[cn2 examples]$ /apps/gm/bin/gm_board_info | grep "this"
   0:   BUSY 21188  (this process [gm_board_info])
Route table for this node follows:
   1 00:60:dd:49:82:ab                              cn2 (this node)
[hpsupt at cn2 examples]$ /apps/gm/bin/gm_board_info | grep "his"
This node is "cn2"
   0:   BUSY 21190  (this process [gm_board_info])
Route table for this node follows:
   1 00:60:dd:49:82:ab                              cn2 (this node)
---

I guess I forgot that this is against gm 2.0.5 as well. :-)

I can get the code to work with mpirun ... 

-- Chris 





More information about the mpiexec mailing list