[gridengine users] Reservation and hard queue list
rayson at scalablelogic.com
Wed Feb 8 01:20:04 UTC 2012
On Tue, Feb 7, 2012 at 5:13 PM, Melot Matthieu
<Matthieu.Melot at andritz.com> wrote:
> For the record I also mention that I set 30 days as default duration and
> I use the first version of Open Grid Scheduler, which should be very
> close to Sun 6.2u5 I suppose.
I quickly checked the list of things we put into OGS 6.2u5p1 - they
are really minor bug fixes that should not affect reservation or
I will re-read (I'm working on something else at this time) your
original email and see if there is anything that we should change to
handle this case.
> So the problem was a bad interaction between RQS and reservation.
> Thanks again
> Matthieu Melot
> -----Original Message-----
> From: Reuti [mailto:reuti at staff.uni-marburg.de]
> Sent: Tuesday, February 07, 2012 4:16 PM
> To: Melot Matthieu
> Cc: users at gridengine.org
> Subject: Re: [gridengine users] Reservation and hard queue list
> Am 07.02.2012 um 21:25 schrieb Melot Matthieu:
>> I have a question on the ability to ask for a reservation that spans
> different queues. Here is the situation:
>> Let's say that I have a queue defined on one node of 2 slots, I have a
> RQS that limit the number of slots per host to 2
>> I launch the following jobs (dummy.q has one execution host of 2
>> qsub -N sleeper1 -pe mpi 1 -q dummy.q sleeper.sh 3600
>> qsub -N sleeper2 -pe mpi 1 -q dummy.q sleeper.sh 3600
>> qsub -N sleeper3 -pe mpi 1 -q dummy.q sleeper.sh 3600
> NB: What is the default runtime which is assumed for the above jobs, as
> no h_rt is given?
> $ qconf -ssconf
> entry "default_duration".
>> I have jobs sleeper1 and sleeper2 running,
>> and then I launch,
>> qsub -N reserv1 -pe mpi 2 -q dummy.q -R y -l h_rt=3600 sleeper.sh 3600
>> I have the intended behavior, namely that if I stop sleeper1, sleeper3
> won't start, and if I stop sleeper2, reserv1 can start which is what I
> want. Also in $SGE_ROOT/$SGE_CELL/common/schedule, I can see that the
> keyword RESERVING appears, and confirm that all the stuff is working.
>> Now, I delete everything and I relaunch sleeper1, 2 and 3 as before.
>> Then I launch the job reserv2 that I want to run on a different queue
> (dummy2.q) that is again defined on the same host with two slots:
>> qsub -N reserv2 -pe mpi 2 -q dummy2.q -R y -l h_rt=3600 sleeper.sh
>> In that situation I have no more effect of the reservation. If I stop
> sleeper1, sleeper3 will start immediately and prevent reserv2 to start.
> Also, in the schedule file, there are no more RESERVING keywords that
> Which version of SGE are you using? I remember a similar thread:
> -- Reuti
>> Do I make something wrong with the configuration? Is it because of the
> multiple instance queues / host, limited with an RQS that I don't have
> the reservation? If yes, what would be an alternative for the desired
>> Thank you in advance for your help
>> Matthieu Melot
>> This message and any attachments are solely for the use of the
> intended recipients. They may contain privileged and/or confidential
> information or other information protected from disclosure. If you are
> not an intended recipient, you are hereby notified that you received
> this email in error and that any review, dissemination, distribution or
> copying of this email and any attachment is strictly prohibited. If you
> have received this email in error, please contact the sender and delete
> the message and any attachment from your system.
>> Thank You.
>> users mailing list
>> users at gridengine.org
> users mailing list
> users at gridengine.org
More information about the users