mpiexec tm_poll error
Pete Wyckoff
pw at osc.edu
Mon Jun 10 17:45:38 EDT 2002
parnell at msi.umn.edu said:
> I stubbled upon the answer (kinda funny). I thought to myself well
> I wonder if PBSPro could operate with the pbs_demux of OpenPBS, perhaps
> that is the problem since it seems to always be this error "Connection
> refused (111) in open_demux". Well why not give it a try, just backup
> the Pro version and put in the Open version. Hmm, I say to myself, it's
> a little odd that there doesn't seem to be this pbs_demux on my client
> machines. Apperently pbs_demux is not in the pbs-mom rpm (in theory
> the only one you would need on an execution host). I copied this over
> to all the compute nodes and pow, works like a charm.
>
> So, anyway, I thought you might like to know that it appears it will
> work fine with PBS Pro (though I suppose without some of the hacks),
> I think the logs were strange enough that other folks might have the
> same problem.
Not sure that I parse this: you mean to say there is no pbs_demux
used in PBSPro? And to get mpiexec to work with PBSPro it suffices
to copy the pbs_demux from the OpenPBS distribution onto your compute
nodes? I am completely baffled... how could any tm_spawn-ed code
produce output in a PBSPro system without the demuxer? Note that
mpiexec doesn't do a dang thing with pbs_demux, but perhaps it is the
tm_spawn code itself which is failing. That would be a big bug with
the PBSPro distribution itself, no?
Interesting, at least. Thanks for the note.
-- Pete
More information about the mpiexec
mailing list