tm-problems, smp-prob?
Charland, Denis
Denis.Charland at nrc.ca
Wed Feb 26 09:38:47 EST 2003
>
> - what means the following??:
>
> #################
> Error received by batch job output Feb 24 23:31 trailing.queue.e1279
>
> mpiexec: Error: wait_tasks: tm_poll remote: tm: system error.
>
> Asynchron communication in mpicall never finished or died
>
> DDD [000] ERROR 04200: receive-timeout for IF 14 in DDD_IFAExchange
> DDD [000] ERROR 04201: waiting for message (from proc 1, size 2080)
> #################
>
> Any hints?
Stefan,
Verify that on every compute node, the following files are available in the
sbin directory of your OpenPBS installation:
pbs_demux, pbs_iff and pbs_rcp
Regards,
Denis Charland
National Research Council of Canada
More information about the mpiexec
mailing list