mpiexec tm_poll error

Stefan Parnell parnell at msi.umn.edu
Mon Jun 10 17:50:37 EDT 2002


Sorry, let me clarify, there was no pbs_demux in the "pbs-mom" rpm
package of PBSPro, in theory a seperate rpm you can install on
execution nodes where you don't need the entire distribution.  Our
University has had an issue with the contract for the source dist
of PBSPro so we've relied upon the binary dists.  I found pbs_demux
on the server (it had the full dist of PBSPro) and copied this over 
to all the execution nodes.  

Stefan

In reply to Pete Wyckoff (pw at osc.edu):

> parnell at msi.umn.edu said:
> > I stubbled upon the answer (kinda funny).  I thought to myself well
> > I wonder if PBSPro could operate with the pbs_demux of OpenPBS, perhaps
> > that is the problem since it seems to always be this error "Connection 
> > refused (111) in open_demux". Well why not give it a try, just backup 
> > the Pro version and put in the Open version. Hmm, I say to myself, it's
> > a little odd that there doesn't seem to be this pbs_demux on my client
> > machines.  Apperently pbs_demux is not in the pbs-mom rpm (in theory
> > the only one you would need on an execution host). I copied this over 
> > to all the compute nodes and pow, works like a charm.
> > 
> > So, anyway, I thought you might like to know that it appears it will
> > work fine with PBS Pro (though I suppose without some of the hacks), 
> > I think the logs were strange enough that other folks might have the 
> > same problem.
> 
> Not sure that I parse this:  you mean to say there is no pbs_demux
> used in PBSPro?  And to get mpiexec to work with PBSPro it suffices
> to copy the pbs_demux from the OpenPBS distribution onto your compute
> nodes?  I am completely baffled... how could any tm_spawn-ed code
> produce output in a PBSPro system without the demuxer?  Note that
> mpiexec doesn't do a dang thing with pbs_demux, but perhaps it is the
> tm_spawn code itself which is failing.  That would be a big bug with
> the PBSPro distribution itself, no?
> 
> Interesting, at least.  Thanks for the note.
> 
> 		-- Pete

-- 
Stefan Parnell             <parnell at msi.umn.edu>
UNIX Systems Administrator,
University of Minnesota 
Supercomputing Institute for Digital Simulation and Advanced Computation
--



More information about the mpiexec mailing list