mpiexec tm_poll error

Pete Wyckoff pw at osc.edu
Mon Jun 10 17:45:38 EDT 2002


parnell at msi.umn.edu said:
> I stubbled upon the answer (kinda funny).  I thought to myself well
> I wonder if PBSPro could operate with the pbs_demux of OpenPBS, perhaps
> that is the problem since it seems to always be this error "Connection 
> refused (111) in open_demux". Well why not give it a try, just backup 
> the Pro version and put in the Open version. Hmm, I say to myself, it's
> a little odd that there doesn't seem to be this pbs_demux on my client
> machines.  Apperently pbs_demux is not in the pbs-mom rpm (in theory
> the only one you would need on an execution host). I copied this over 
> to all the compute nodes and pow, works like a charm.
> 
> So, anyway, I thought you might like to know that it appears it will
> work fine with PBS Pro (though I suppose without some of the hacks), 
> I think the logs were strange enough that other folks might have the 
> same problem.

Not sure that I parse this:  you mean to say there is no pbs_demux
used in PBSPro?  And to get mpiexec to work with PBSPro it suffices
to copy the pbs_demux from the OpenPBS distribution onto your compute
nodes?  I am completely baffled... how could any tm_spawn-ed code
produce output in a PBSPro system without the demuxer?  Note that
mpiexec doesn't do a dang thing with pbs_demux, but perhaps it is the
tm_spawn code itself which is failing.  That would be a big bug with
the PBSPro distribution itself, no?

Interesting, at least.  Thanks for the note.

		-- Pete



More information about the mpiexec mailing list