mpiexec & PBS Professional 7.1: "PBS reports fewer hosts than TM"
Thomas Zeiser
thomas.zeiser at rrze.uni-erlangen.de
Fri Mar 31 14:29:17 EST 2006
Dear all,
since upgrading from PBS Professional 7.0 to 7.1 we get the
following error message when starting jobs with mpiexec
/opt/mpiexec-0.80/bin/mpiexec -n 2 -comm none hostname
mpiexec: Error: get_hosts: PBS reports fewer hosts 1 than TM 2.
Recompiling mpiexec with the updated PBS libraries / includefiles
does not help. The machine is an SGI Altix (IA64) with SuSE
SLES9SP3/ProPack4. With both PBS Professional versions we use the
pbs_mom with cpusets
The PBS output itself seems reasonable:
qsub -I -lncpus=2
qstat -an 74842.altix
Job ID Username Queue Jobname SessID NDS TSK Memory Time S Time
--------------- -------- -------- ---------- ------ --- --- ------
----- - -----
74842.altix unrz143 parallel STDIN 20219 -- 2 -- 00:30 R 00:02
altix:ssinodes=1:mem=3987456kb:ncpus=2
env | grep PBS
PBS_O_HOME=/home/unrz/unrz143
PBS_O_LANG=en_US.UTF-8
PBS_O_LOGNAME=unrz143
PBS_O_PATH=....
PBS_O_MAIL=/mail/unrz143
PBS_O_SHELL=/bin/tcsh
PBS_O_HOST=altix
PBS_O_WORKDIR=/home/unrz/unrz143
PBS_O_SYSTEM=Linux
PBS_O_QUEUE=router
PBS_JOBNAME=STDIN
PBS_JOBID=74842.altix
PBS_QUEUE=parallel
PBS_JOBCOOKIE=6EA777E344AE7098
PBS_NODENUM=0
PBS_TASKNUM=1
PBS_MOMPORT=15003
PBS_NODEFILE=/var/spool/PBS/aux/74842.altix
PBS_ENVIRONMENT=PBS_INTERACTIVE
cat /var/spool/PBS/aux/74842.altix
altix
altix
cat /dev/cpuset/PBSPro/unrz14374842.altix/cpus
14-15
Any ideas??
Kind regards,
Thomas Zeiser
--
Dipl.-Ing. Thomas ZEISER
Regionales Rechenzentrum Erlangen
Martensstr. 1, 91058 Erlangen, GERMANY
More information about the mpiexec
mailing list