mpiexec and PBSPro's new syntax
Pete Wyckoff
pw at osc.edu
Wed May 7 11:17:24 EDT 2008
bfp at purdue.edu wrote on Tue, 06 May 2008 11:03 -0400:
> If I apply Matt Ford's patch to "get_hosts.c" in mpiexec-0.83, I'm able to
> succesfully use mpiexec with PBSPro 9, as long as I request nodes using
> the PBSPro "old style" syntax for requesting nodes, as in
>
> qsub -I -l nodes=2:ppn=2 ...
> cat $PBS_NODEFILE
> tempest-000
> tempest-000
> tempest-001
> tempest-001
>
> mpiexec -np 4 ./a.out
>
>
> However, if I use PBSPro's latest syntax for rquesting nodes, I get an
> error from mpiexec.
>
> qsub -I -l select=2:mpiprocs=2,place=scatter
> cat $PBS_NODEFILE
> tempest-000
> tempest-000
> tempest-001
> tempest-001
>
> mpiexec -np 4 ./a.out
> mpiexec: Error: get_hosts: PBS reports fewer tasks 2 than TM 4.
>
>
> Would there by any chance be another patch available for PBSPro
> version 9 ?
There's nothing newer than that partial patch. The hope was we
could figure something out that would work for all versions of PBS.
If you dig around the archives a bit, you might find a request from
me to a PBSPro user to send the ouput of qstat -f in a few different
job configurations. Mpiexec uses the Resource_List.* variables (and
could use others) to figure out what the hosts are.
A bit of explanation on what that select=... syntax means would be
useful too. Like what mpiexec should do depending on what it finds
in the qstat -f output.
If you come up with a snazzy patch, it will be easier for me to
integrate and make sure open PBS still works.
-- Pete
More information about the mpiexec
mailing list