[gridengine users] slot reservation
reuti at staff.uni-marburg.de
Mon Jul 2 09:15:58 UTC 2012
Am 02.07.2012 um 08:09 schrieb Steve Schmerler:
> On Jul 01 21:10 +0200, Reuti wrote:
>> Am 01.07.2012 um 19:55 schrieb Steve Schmerler:
>>> priority. Also, there are no entries about reservation in
>>> $SGE_ROOT/$SGE_CELL/common/schedule and the "RES" column in "qstat -g
>>> c"'s output is always zero.
>> This output is for Advance Reservation (`man qstat` section "OUTPUT FORMAT"). Not the one for reserving resources.
> OK, that explains it.
>> Well, as SGE judges INIFINITY being smaller than INFINITY, always smaller jobs can slip in. As they are known to finish before additional resources are getting free (which are blocked by other jobs to infinity), they are backfilling the cluster all the time.
> Good to know. I read that in older ML posts, but now I really
> understand! Is this documented?
>> You will have to provide suitable runtimes for reservations to succeed for all jobs. As a start you can define the "default_duration" being one year or so, i.e. longer than your longest job. But it would be better to have correct estimations.
> We have several queues with run time limits (h_rt). If that value is set
No, the default runtime will be used. Although it might be better to use the tighter constraint here, once a job started in a queue with a set h_rt limit.
> (or by qsub -l h_rt=...)
> , isn't that used instead of default_duration?
> If so, I could in theory leave default_duration = INFINITY since any
> h_rt < INFINITY.
If you request it for all, then the default_duration will never be used. But in your original post you mentioned that there are no time limits set anywhere.
>>> Also, I may not fully understand what max_reservation is. Right now I
>>> set it to 128 which I assume is the max number of slots to be reserved
>> It depends on the size of the cluster. But usally around 32 should do.
> OK, but it is not the actual number of reserved slots, is it?
> users mailing list
> users at gridengine.org
More information about the users