[gridengine users] How to take a node offline for maintenance?
d.love at liverpool.ac.uk
Sun May 8 13:09:59 UTC 2011
"Esztermann, Ansgar" <Ansgar.Esztermann at mpi-bpc.mpg.de> writes:
> On May 4, 2011, at 2:01 , William Deegan wrote:
>> What's the best way to take a node offline for maintenence?
> I'd say that depends on the circumstances.
> If you want to perform maintenance on a single node, as soon as
> possible (e.g. the node has ECC or SMART errors), use qmod -d to
> disable it, then take it down when all current jobs have finished.
One advantage of putting it into a restricted group is that you can
submit a job to tell you when it's become free, or to reboot it.
More information about the users