[gridengine users] preventing certain jobs from being suspended (subordinated)

Reuti reuti at staff.uni-marburg.de
Wed Sep 4 20:52:39 UTC 2019


Am 04.09.2019 um 21:58 schrieb bergman at merctech.com:

> Our SoGE (8.1.6) configuration has essentially two queues: one for "all"
> jobs and one for "short jobs". The all.q is subordinate to the short.q,
> and short jobs can suspend a job in the general queue. At the moment, the
> all.q has nodes with & without GPU resources (not ideal, not permanent,
> probably to be replaced in the future with multiple queues, but it's
> what we have now).
> 
> Our GPU jobs do not stop or free resources when suspended (OK, the CPU
> portion may respond correctly to SIGSTOP, but the GPU portion keeps
> running).
> 
> Is there any way, with our current number of queues, to exempt jobs
> using a GPU resource complex (-l gpu) from being suspended by short jobs?

Not that I'm aware of. Almost 10 years ago I had a similar idea:

https://arc.liv.ac.uk/trac/SGE/ticket/735

-- Reuti



More information about the users mailing list