[gridengine users] RQS and scheduler performance (max-slots-on-all-hosts)

Joshua Baker-LePain jlb at salilab.org
Thu Apr 26 17:33:43 UTC 2012


On Thu, 26 Apr 2012 at 1:24pm, Stuart Barkley wrote

> On Thu, 26 Apr 2012 at 12:57 -0000, Reuti wrote:
>
>> What was your "schedule_interval" set to?
>>
>> Was "schedd_job_info true" set by accident?
>
> I've played with various values.  My current setting are:
>
>  % qconf -ssconf
>  schedule_interval                 0:01:00
>  schedd_job_info                   true
>  flush_submit_sec                  5
>  flush_finish_sec                  5
>  report_pjob_tickets               TRUE
>
> Previously the scheduler seemed to be taking ~5 minutes to do one pass
> and appeared to be doing scheduling passes back-to-back.  I think I
> tried setting schedule_interval to 10 minutes once and then there was
> some idle time between scheduler runs.

Well, this is disappointing.  I noticed something similar with our rather 
ancient 6.1u3 install -- scheduling runs were taking a long, long time 
whenever anybody submitted jobs to our PEs, and setting "max_reservation 
0" made them run much more quickly.  We do have 4 RQSs, but I haven't 
tried disabling them as they are rather essential to our setup.  In all 
cases, schedd_job_info was set to false.

I'm in the midst of installing hardware for a new server on which I'll be 
running a much newer SGE version.  I was rather hoping that this would 
allow me to get reservations, RQSs, and "schedd_job_info true" (since it 
is rather handy) all working together.  Apparently not...

-- 
Joshua Baker-LePain
QB3 Shared Cluster Sysadmin
UCSF


More information about the users mailing list