[gridengine users] Limit number of jobs by job name
Reuti
reuti at staff.uni-marburg.de
Mon Feb 6 22:03:13 UTC 2012
Am 06.02.2012 um 22:44 schrieb Reuti:
> <snip>
> As the parent jobs are not doing any work, a special parent.q would do which has to be requested by a forced boolean complex, so that only parent jobs can get in. You could even set a h_cpu limit on this queue to avoid abuse - jobs abusing this queue would get killed after 5 minutes or so. The overall slot count used in this cluster queue you can limit in an RQS.
I didn't mention it: you oversubscribe by intention. In case you have further queues on each host, the RQS for limiting the overall slot count will need two entries:
limit name parents queues parent.q hosts {*} to slots=1
limit name slots hosts {*} to slots=$num_proc
The first matching rule will end the RQS. The second RQS for the parent.q overall use it like usual:
limit name parents queues parent.q to slots=17
Please adjust the numbers to your needs. You could even run all parents on one machine, but I would spread it around.
-- Reuti
More information about the users
mailing list