tm-problems, smp-prob?

Charland, Denis Denis.Charland at nrc.ca
Wed Feb 26 09:38:47 EST 2003


> 
> - what means the following??:
> 
> #################
> Error received by batch job output Feb 24 23:31 trailing.queue.e1279
> 
> mpiexec: Error: wait_tasks: tm_poll remote: tm: system error.
> 
> Asynchron communication in mpicall never finished or died
> 
> DDD [000] ERROR 04200: receive-timeout for IF 14 in DDD_IFAExchange
> DDD [000] ERROR 04201:   waiting for message (from proc 1, size 2080)
> #################
> 
> Any hints?

Stefan,

Verify that on every compute node, the following files are available in the
sbin directory of your OpenPBS installation:

   pbs_demux, pbs_iff and pbs_rcp

Regards,

Denis Charland
National Research Council of Canada



More information about the mpiexec mailing list