[gridengine users] User job fails silently
klin938 at gmail.com
Wed Aug 8 06:15:48 UTC 2018
I have a user reported his jobs stuck running for much longer than usual.
So I go to the exec host, check the process and all processes owned by that
user look like:
`- -bash /opt/gridengine/default/spool/omega-6-20/job_scripts/1187671
In qstat, it still shows job is in running state.
The user resubmitted the jobs and they ran and completed without an problem.
I am wondering what may has caused this situation in general?
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the users