[gridengine users] queues behaving differently
j.e.young at larc.nasa.gov
Tue Jul 10 20:23:46 UTC 2012
On 07/10/2012 04:14 PM, Rayson Ho wrote:
> On Tue, Jul 10, 2012 at 4:02 PM, John Young <j.e.young at larc.nasa.gov> wrote:
>>> If you really have a real use-case for setting the # of descriptors in
>>> the queue config, then let us know and we can implement that in OGS/GE
>>> (... when time permits).
>> Well... I have an engineer here who want to run a 2048 core job. When
>> he tries, he gets this error:
> I mean, a real use-case that needs the limit to be part of the queue
> config, and one that can't just use the method I described in:
> If you have a real good reason that setting S_DESCRIPTORS &
> H_DESCRIPTORS in sge_conf(5) is not enough, then we can try to improve
> things a bit... but the bottom line is that we need a good use-case.
> (Even if we had unlimited time, we still don't want to add code that
> duplicates existing code. Having said that, we can remove existing
> code and switch to the new code if needed.)
> You should be able to fix the issue by setting S_DESCRIPTORS &
> H_DESCRIPTORS, or you can check the environment that starts the execd,
> and see if it has a lower descriptor limit...
I have already modified the queue config as follows:
execd_params S_DESCRIPTORS=20480 H_DESCRIPTORS=65536 \
With this in place, it seems odd that from one of my queues I
get a default setting for the number of descriptors of 1024.
So I have two questions really:
1. Why am I getting different behavior from the two queues?
2. Since H_DESCRIPTORS=65536, why can't I request that amount and get it?
More information about the users