mpiexec svn and mvapich 0.97 failure to start jobs

Pete Wyckoff pw at osc.edu
Fri Jul 7 14:17:18 EDT 2006


jtang at tchpc.tcd.ie wrote on Fri, 07 Jul 2006 15:38 +0100:
> On Wed, Mar 15, 2006 at 04:55:22PM -0500, Pete Wyckoff wrote:
> > This "connect: Connection refused" message comes from the MPI task,
> > not from mpiexec.  I have seen this before, unfortunately.  Looking
> > at mvapich-0.9.7, it is clear that they included a misguided patch
> > from NERSC that the Mellanox people picked up.  I told the original
> > author and Mellanox that it was a bad idea, almost a year ago.
>
> the mvapich devs have pretty much removed the offending code from 0.9.8rc0
> and higher. I've compiled it up and tested both the 0.9.8rc0 and trunk
> branches, and mpiexec 0.80 works as expected. I would expect 0.81 would
> probably work as well, but I havent tested it.
> 
> well, this was just to bump this thread with some updated information.

Whew, glad that is finally fixed.  Thanks for poking the OSU folks
and letting us know.  I updated the mpiexec web page and README.

		-- Pete


More information about the mpiexec mailing list