[gridengine users] How to take a node offline for maintenance?

Dave Love d.love at liverpool.ac.uk
Sun May 8 13:09:59 UTC 2011


"Esztermann, Ansgar" <Ansgar.Esztermann at mpi-bpc.mpg.de> writes:

> Hi,
>
> On May 4, 2011, at 2:01 , William Deegan wrote:
>
>> What's the best way to take a node offline for maintenence?
>
> I'd say that depends on the circumstances.

Right.

> If you want to perform maintenance on a single node, as soon as
> possible (e.g. the node has ECC or SMART errors), use qmod -d to
> disable it, then take it down when all current jobs have finished.

One advantage of putting it into a restricted group is that you can
submit a job to tell you when it's become free, or to reboot it.



More information about the users mailing list