Checkpointing with mpiexec

Artem Polyakov artpol84 at gmail.com
Mon Jun 16 03:28:56 EDT 2008


Hello all.

I try to use mpiexec with checkpointing program, which considers all sockets
and descriptors in the program. First problem I faced is that checkpointing
entire mpiexec have following problem:
When I restart from checkpointed image restoring program searches temporary
files created by PBS and fails when did not find them. Is it possible to
divide mpiexec into 2 parts:
1. Gathering information about execution resources from PBS
2. Starting the program using predetermined temporary files (not depended on
query ID and so on).

Best regards, Polyakov Artem
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://email.osc.edu/pipermail/mpiexec/attachments/20080616/56cbcbed/attachment.htm


More information about the mpiexec mailing list