[gridengine users] RQS and scheduler performance (max-slots-on-all-hosts)
rayson at scalablelogic.com
Thu Apr 26 18:37:47 UTC 2012
On Thu, Apr 26, 2012 at 12:38 PM, Stuart Barkley <stuartb at 4gh.net> wrote:
> <rant>My internal logic says it should take any where near the
> original time to schedule 2000 jobs. But an awful lot of today's code
> will just consume resources without good reason. I come from a time
> when compute resources where actually very expensive and people paid
> attention to performance. Now-a-days, it seems people are willing to
> just throw memory and cpu at problems instead of careful
Note that due to the design of RQS, there is a lot of redundant work
in the scheduler, and this is not something we can fix overnight.
If the cluster mostly runs single slot jobs (serial jobs), then you
can just use the maxujobs in sched_conf(5) to limit the running jobs
> This restores my belief in the original Grid Engine coders.
> (still using sge6.2u5, CentOS 5)
> Stuart Barkley
> I've never been lost; I was once bewildered for three days, but never lost!
> -- Daniel Boone
> users mailing list
> users at gridengine.org
More information about the users