[gridengine users] Node refuse to run job
"Hung-Sheng Tsao (Lao Tsao 老曹) Ph.D."
laotsao at gmail.com
Thu Feb 9 17:59:40 UTC 2012
check the CELL/spool/ directory of the qmaster and nodes
On 2/9/2012 12:51 PM, Jerome wrote:
> Dera all
>
> I have the SGE version GE 6.2u2_1 on a Rocks cluster.
> Since few days, a node refuse to run a job. using "qstat -j jid", i
> notice this line a the end of the output:
>
> cannot run on host "compute-2-15.local" until clean up of an previous
> run has finished
>
> I revise on the node 2-15, but the jobs directory is totaly empty. To
> be sure about what i do, i reinstall from scratch the node, and the
> problem persists.
> It seems to be the master how is causing this issue. Someone can help
> me on find where is the bad information file that i have to modify to
> let my node running the job?
>
> Best regards.
--
Hung-Sheng Tsao Ph D.
Founder& Principal
HopBit GridComputing LLC
cell: 9734950840
http://laotsao.blogspot.com/
http://laotsao.wordpress.com/
http://blogs.oracle.com/hstsao/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: laotsao.vcf
Type: text/x-vcard
Size: 608 bytes
Desc: not available
URL: <http://gridengine.org/pipermail/users/attachments/20120209/40189e39/attachment.vcf>
More information about the users
mailing list