Capturing return values from concurrent mpiexecs
Martin Schafföner
martin.schaffoener at e-technik.uni-magdeburg.de
Tue Nov 8 14:50:54 EST 2005
On Tuesday 08 November 2005 20:15, Pete Wyckoff wrote:
> What you describe wanting is how I thought it worked now, but
> testing shows it does not. Did it ever work in an earlier version?
> Are you using 0.80 now? I'll play with it and see if I can
Don't know if it worked before; I only started using mpiexec now (0.80) to
replace all rsh-related stuff on our nodes.
> I'd imagine that "mpiexec -server" would always return 0 unless it
> died in some unnatural way---it would never return the exit status
> of tasks it started for any of its clients.
If I send SIGTERM to "mpiexec -server", it returns 1.
> The combined return value of a parallel task is always just the exit
> status of task #0, although there will be warning lines on stderr to
> report the non-zero exit statuses of other tasks. I couldn't come
> up with any better way of reporting the array of exit statuses. Any
> suggestions?
Well, I guess only returning the exit status of task #0 and warning about
others is fine, I think. Only if I start, say, 8 concurrent "mpiexec -n x"
while "mpiexec -server" is running I would like to get 8 return values.
I have also been tinkering with the source code, adding trace stuff here and
there, but have not found the cause of the failure.
Regards,
--
Martin Schafföner
Cognitive Systems Group, Institute of Electronics, Signal Processing and
Communication Technologies, Department of Electrical Engineering,
Otto-von-Guericke University Magdeburg
Phone: +49 391 6720063
More information about the mpiexec
mailing list