Running job monitor

Un article de Informaticiens département des sciences de la Terre et l'atmosphère
Version depuis le 22 de novembre 2011 à 17:33 par Michel (Discuter | changes)
(diff) ←Version avant | Version courante (diff) | Version après→ (diff)
Aller à: navigation, charcher

en construction

under construction

A running job may me monitored using a local utility called

u.job-monitor


There a 2 ways to activate this utility

  • at job submit time:
    ord_soumet ....  -prolog jobmonitor ....
  • with an explicit command in the job itself
    u.job-monitor &


caveat: in the case of an MPI job the only node that will be monitored is node 0 (primary node)


the job monitor uses 3 files found in directory $HOME/top_in_batch  for each monitored job

  • node_jobid.top
  • node_jobid.cmd
  • node_jobid.out

where node will be replaced by the host name of the primary node of the job

and jobid will be replaced by the PBS job id of said job