[gridengine users] guess required memory
reuti at staff.uni-marburg.de
Wed Jun 6 14:48:24 UTC 2012
Am 06.06.2012 um 06:27 schrieb Rayson Ho:
> On Wed, Jun 6, 2012 at 12:03 AM, Nakata Nakata <nakatahee at ymail.com> wrote:
>> I ran executable openmp program without job.I can resident memory via "top"
>> command (1.5G)
> How many threads do you usually use to run the OpenMP code??
It depends on the actual program, but usually threads are working on the same memory area. It would be interesting if the applications runs up to a certain number of threads and then starts to fail.
>> when job submitted h_vmem value should be set to high value otherwise job be
> Did Grid Engine kill it, or did the program itself crash due to some
> other limits in the system (like the linux kernel OOM killer did it)??
> Can you check the log & qacct record and see who the killer is??
> Note that Grid Engine has online job accounting - you can run qstat -j
> <jobid> and see the resource usage of the job. If Grid Engine is
> reporting more memory usage than the numbers reported by top (check
> both VIRT & RSS), then it can be shared pages that the Grid Engine's
> procfs parser currently does not understand.
> Note that with cgroups shared memory pages-related accounting error
> will be gone, as the kernel's memory accounting code is more accurate
> than the user-space interface.
> OT: Talking about cgroups, I still need to go back to Mark's email and
> re-read it...
>> From: Reuti <reuti at staff.uni-marburg.de>
>> To: Nakata Nakata <nakatahee at ymail.com>
>> Cc: "users at gridengine.org" <users at gridengine.org>
>> Sent: Tuesday, 5 June 2012, 16:40
>> Subject: Re: [gridengine users] guess required memory
>> Am 05.06.2012 um 13:28 schrieb Nakata Nakata:
>>> Good time all,
>>> I set h_vmem to physical memory 24G.
>>> an executable needs 1.5G memory.
>>> I ran the executable via job and set h_vmem. It seems h_vmem should set
>>> much higher than real e.g 20G!
>>> How can guess h_vmem value for it?
>>> what such high value for h_vmem is needed for?
>> Which version are you using and where are you oberserving the real usage of
>> 20G. I.e. you only request 1.5G, but you would expect the job to crash with
>> such a low limit? Is it a serial or parallel application?
>> -- Reuti
>> users mailing list
>> users at gridengine.org
More information about the users