[gridengine users] Simplifying Parallel Environments
Mark Dixon
m.c.dixon at leeds.ac.uk
Fri Feb 3 15:16:45 UTC 2012
On Thu, 2 Feb 2012, Brian Smith wrote:
...
> In our site, it just so happens that the queues are related to the
> topology/fabric breakdown. I'm planning on implementing this feature
> using a soft request (e.g. I would like my job to run on nodes living on
> the same edge switch). PEs themselves have been a problem in our shop.
> I always get "this is complicated" or "they don't do it like this at
> teragrid", or "my last facility used PBS/LSF/etc". Yes, I'm catering to
> that crowd. They are the squeaky wheel and I've gotta give 'em the grease.
Which of course is where your JSV comes in - brilliant, aren't they? :)
For what it's worth, we have a single PE that we tell our users to submit
to, then a JSV does all the heavy lifting in figuring out what actual PE
it should use.
Of course we make all that optional, so we don't get in the way of people
who want more control.
...
> Fortunately, -l exclusive is seldom used in our shop and we usually only
> allow its use with an AR. We favor utilization of the resources so the
> higher-ups know they are being utilized.
So why do you need the nodes/ppn syntax? Do you have users who optimise
their code for specific grid patterns?
...
> Did that for a long time! Back in the 5.x days, SGE shipped with a
> template PE that had a script called 'sge_mpirun'. I still use that
> though it looks nothing like it used to. The big problem has been
> implementing PPN while supporting myriad MPI implementations. I also
> found that using a wrapper is great... until someone shows up with an
> application that can't be wrapped. Happens a lot.
...
I found that many of those problems go away if you're sneaky... like
calling your mpirun wrapper "mpirun". For really crazy things, we use the
queue starter_method :)
Perhaps we've been lucky, but I don't think we've got anything that cannot
be wrapped in some shape or form at the moment.
Cheers,
Mark
--
-----------------------------------------------------------------
Mark Dixon Email : m.c.dixon at leeds.ac.uk
HPC/Grid Systems Support Tel (int): 35429
Information Systems Services Tel (ext): +44(0)113 343 5429
University of Leeds, LS2 9JT, UK
-----------------------------------------------------------------
More information about the users
mailing list