[gridengine users] Partial Solution: message: ... reports running job ... that was not supposed to be there - killing
Dave Love
d.love at liverpool.ac.uk
Thu Aug 11 20:03:29 UTC 2011
Reuti <reuti at staff.uni-marburg.de> writes:
> I think the message in the subject happens when there is something in the spool directory of the node like "$SGE_ROOT/default/spool/node01/jobs/00/0000/515" while there is nothing in "active_jobs" any longer. So it can't kill anything.
>
> Clearing the node's "jobs" directory may resolve it.
Yes, but surely execd should tidy up properly in that case. As far as I
remember, it looked like a question of ignoring a potential error and
carrying on.
More information about the users
mailing list