What does this error mean?
Thomas Zeiser
thomas.zeiser at rrze.uni-erlangen.de
Wed Jan 30 16:21:27 EST 2008
Hi Andrey,
hi Kevin,
On Wed, Jan 30, 2008 at 07:17:36PM +0300, Derbunovich, Andrey wrote:
> Kevin,
>
> Thank you. Actually we never test our library with this mpiexec version.
we have Intel MPI 3.1.026 successfully running with torque-2.1.x
and Pete's mpiexec-0.82
(there are only sometimes timeout problem if quite large Infiniband
jobs start - but that's an other storry)
> Did you able to use this mpiexec with earlier versions of the Intel MPI
> Library?
>
> Best regards,
> Andrey
> > > > > > mpiexec: read_keyvals: keyval 1 key kvsname val
> > 53001.jman-spawn-0.
> > > > > > mpiexec: read_keyvals: keyval 2 key key val DAPL_PROVIDER.
> > > > > > mpiexec: read_keyvals: keyval 3 key value val <NULL.
Kevin: what interconnect are you trying to use?
as "DAPL_PROVIDER" appears here, it looks for me that you use
Infiniband or Myrinet and not plain ethernet - thus giving
additional potential for failures.
Can you check if your job starts fine if you force Intel MPI to the
socks device by "export I_MPI_DEVICE=ssm".
thomas
--
Thomas Zeiser, HPC Services
Friedrich-Alexander-Universitaet Erlangen-Nuernberg
Regionales Rechenzentrum Erlangen (RRZE)
Martensstrasse 1, 91058 Erlangen, Germany
Tel.: +49 9131 85-28737, Fax: +49 9131 302941
thomas.zeiser at rrze.uni-erlangen.de
http://www.rrze.uni-erlangen.de/hpc/
More information about the mpiexec
mailing list