[gridengine users] nodes in alarm

Marty Dippel mdippel at iit.edu
Fri Sep 23 20:55:16 UTC 2011

SGE Newbie question-

When I "qstat -f" a few of the nodes return an "a" state, which I
believe means the node is in alarm.

queuename                      qtype used/tot. load_avg arch          states
all.q at compute-4-6.local        BIP   2/2       4.03     lx26-amd64    a
  35329 0.50894 finer3a    abaezgua     r     09/23/2011 11:08:04     2


1. What's the best way for me to discover the cause of the alarm state?

2. Once a node is in alarm, will it reset by itself when the condition
is corrected or will it require human intervention to clear this state?

Martin Dippel
Systems Administrator
Illinois Institute of Technology

More information about the users mailing list