[gridengine users] Dead nodes running jobs
d.love at liverpool.ac.uk
Fri Sep 6 22:46:06 UTC 2013
François-Michel L'Heureux <fmlheureux at datacratic.com> writes:
> While investigating jobs that have been running for way too long, I've
> found out that qhost shows nodes that are dead with "alive stats" such
> as load, memuse and swapus. qstat also shows them processing jobs with
> state "r", as if the node was there and working.
Yes, it's a known bug somewhere in the issue tracker that I've never got
round to tracking down.
Community Grid Engine: http://arc.liv.ac.uk/SGE/
More information about the users