[gridengine users] Node refuse to run job

"Hung-Sheng Tsao (Lao Tsao 老曹) Ph.D." laotsao at gmail.com
Thu Feb 9 17:59:40 UTC 2012


check the CELL/spool/ directory of the qmaster and nodes


On 2/9/2012 12:51 PM, Jerome wrote:
> Dera all
>
> I have the SGE version GE 6.2u2_1 on a Rocks cluster.
> Since few days, a node refuse to run a job. using "qstat -j jid", i 
> notice this line a the end of the output:
>
> cannot run on host "compute-2-15.local" until clean up of an previous 
> run has finished
>
> I revise on the node 2-15, but the jobs directory is totaly empty. To 
> be sure about what i do, i reinstall from scratch the node, and the 
> problem persists.
> It seems to be the master how is causing this issue. Someone can help 
> me on find where is the bad information file that i have to modify to 
> let my node running the job?
>
> Best regards.

-- 
Hung-Sheng Tsao Ph D.
Founder&  Principal
HopBit GridComputing LLC
cell: 9734950840

http://laotsao.blogspot.com/
http://laotsao.wordpress.com/
http://blogs.oracle.com/hstsao/

-------------- next part --------------
A non-text attachment was scrubbed...
Name: laotsao.vcf
Type: text/x-vcard
Size: 608 bytes
Desc: not available
URL: <http://gridengine.org/pipermail/users/attachments/20120209/40189e39/attachment.vcf>


More information about the users mailing list