[gridengine users] nodes in alarm

Marty Dippel mdippel at iit.edu
Fri Sep 23 20:55:16 UTC 2011


SGE Newbie question-

When I "qstat -f" a few of the nodes return an "a" state, which I
believe means the node is in alarm.


queuename                      qtype used/tot. load_avg arch          states
----------------------------------------------------------------------------
all.q at compute-4-6.local        BIP   2/2       4.03     lx26-amd64    a
  35329 0.50894 finer3a    abaezgua     r     09/23/2011 11:08:04     2

----------------------------------------------------------------------------


1. What's the best way for me to discover the cause of the alarm state?

2. Once a node is in alarm, will it reset by itself when the condition
is corrected or will it require human intervention to clear this state?


--------------------------------
Martin Dippel
Systems Administrator
Illinois Institute of Technology


More information about the users mailing list