<div dir="ltr">Hi Mathieu, <div><br></div><div>Thank you very much for the excellent answer! I tried the combination of condor_suspend and condor_continue. They worked very well and did exactly what their name suggests. Condor_hold and condor_release do a little different job. </div><div><br></div><div>Thank you very much. </div><div>Sincere greetings,</div><div><br></div><div>Zhengzhi</div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Thu, Apr 9, 2020 at 12:02 AM Mathieu Dupont <<a href="mailto:mdupont@cppm.in2p3.fr">mdupont@cppm.in2p3.fr</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Hi, <br>
<br>
Without Condor, my first idea would be to send SIGTSTP signal to<br>
yours GATE processes runned by condor. And SIGCONT signal to resume<br>
them. <br>
<br>
And by looking at condor documentation, i found command condor_suspend<br>
(<a href="https://htcondor.readthedocs.io/en/stable/man-pages/condor_suspend.html" rel="noreferrer" target="_blank">https://htcondor.readthedocs.io/en/stable/man-pages/condor_suspend.html</a>)<br>
and condor_continue<br>
(<a href="https://htcondor.readthedocs.io/en/stable/man-pages/condor_continue.html" rel="noreferrer" target="_blank">https://htcondor.readthedocs.io/en/stable/man-pages/condor_continue.html</a>)<br>
which seem to do it. Maybe can you try them ? <br>
<br>
<br>
<br>
--------<br>
On Wed, 8 Apr 2020 15:46:52 -0700<br>
Zhengzhi Liu <<a href="mailto:zliu36@stanford.edu" target="_blank">zliu36@stanford.edu</a>> wrote:<br>
<br>
> Dear Gate users,<br>
> <br>
> For some GATE simulation, the runtime could be as long as a couple of<br>
> days even on a 56 cores cluster. However, I can't let my GATE<br>
> simulation occupy all the cores on the cluster during working hours<br>
> since other colleagues are also using the machine. Thus I tried to<br>
> hold my GATE simulation during the working hours and later resume<br>
> previous GATE simulation. The commands I found to achieve this goal<br>
> are condor_hold<br>
> <<a href="https://www.cl.cam.ac.uk/manuals/condor-V6_8_3-Manual/condor_hold.html" rel="noreferrer" target="_blank">https://www.cl.cam.ac.uk/manuals/condor-V6_8_3-Manual/condor_hold.html</a>><br>
> and condor_release<br>
> <<a href="https://www.cl.cam.ac.uk/manuals/condor-V6_8_3-Manual/condor_release.html#man-condor-release" rel="noreferrer" target="_blank">https://www.cl.cam.ac.uk/manuals/condor-V6_8_3-Manual/condor_release.html#man-condor-release</a>>.<br>
> Everything works fine that condor_hold can put my GATE jobs on hold<br>
> and condor_relese can resume GATE simulation. Except that running<br>
> condor_release would wipe existed data.<br>
> <br>
> I might have misunderstood the function of condor_hold. Honestly, I<br>
> don't fully understand the description. It might have killed the GATE<br>
> program. Are there any GATE experts who know how to pause GATE<br>
> simulation and resume it at a later time? If this is possible.<br>
> <br>
> Thank you very much for any help.<br>
> Sincere wishes,<br>
> <br>
> Zhengzhi<br>
<br>
<br>
<br>
-- <br>
Mobilisé contre la réforme des retraites et la LPPR<br>
-- <br>
Mathieu Dupont - Ingénieur de Recherche<br>
CENTRE DE PHYSIQUE DES PARTICULES DE MARSEILLE<br>
UMR 7346 - Aix-Marseille Université - CNRS/IN2P3<br>
163 avenue de Luminy, Case 902, F -13288 Marseille CEDEX 09<br>
Tél.: +33 (0) 4 91 82 72 19<br>
Site : <a href="http://cppm.in2p3.fr" rel="noreferrer" target="_blank">cppm.in2p3.fr</a> - Email : <a href="mailto:mdupont@cppm.in2p3.fr" target="_blank">mdupont@cppm.in2p3.fr</a><br>
</blockquote></div>