[gridengine users] Message in stderr after exceeding resources
d.love at liverpool.ac.uk
Fri Mar 4 18:20:19 UTC 2011
Mark Dixon <m.c.dixon at leeds.ac.uk> writes:
> On Thu, 3 Mar 2011, Dave Love wrote:
>> Agreed. I keep meaning to put together a script to extract the
>> available info from qacct, the GE log files, and possibly syslog, post
>> mortem for a job (assuming shared classic spooling). Does anyone else
>> fancy having a go?
> Do you mean something like the attached script?
Very likely so, though I haven't had a chance to check it properly yet.
We should probably send the boys round, maybe with beer, to frisk Leeds
for useful stuff. Thanks again.
> It ticks 2 out of your 3
> boxes. It's hideously out of date, but worked pretty well for SGE 6.0 and
> where the client spool directories were in a central location.
There's still a 6.0 running here...
> I've been meaning to dust it off and bring it up to date for 6.2 and where
> the client spool is local to each compute node (but I arrange for the
> messages file to end up in the central location anyway),
What's your recommended recipe if one has enough pressure on the spool
to make the local spooling worthwhile (which we definitely don't)?
More information about the users