[gridengine users] Partial Solution: message: ... reports running job ... that was not supposed to be there - killing

Dave Love d.love at liverpool.ac.uk
Thu Aug 11 20:03:29 UTC 2011


Reuti <reuti at staff.uni-marburg.de> writes:

> I think the message in the subject happens when there is something in the spool directory of the node like "$SGE_ROOT/default/spool/node01/jobs/00/0000/515" while there is nothing in "active_jobs" any longer. So it can't kill anything.
>
> Clearing the node's "jobs" directory may resolve it.

Yes, but surely execd should tidy up properly in that case.  As far as I
remember, it looked like a question of ignoring a potential error and
carrying on.


More information about the users mailing list