[gridengine users] Intermittent commlib errors with MPI jobs
d.love at liverpool.ac.uk
Sun Nov 18 23:13:17 UTC 2012
Brendan Moloney <moloney at ohsu.edu> writes:
> Ok I will test that out once I can schedule some down time. I might even be able to get my hands on another switch by then.
I doubt switch problems would only affect starting MPI jobs like that.
Something similar is in the archives from relatively recently were there
was lots of talk about switches despite the symptoms clearly being
consistent with the qrsh starter (or shepherd?) dying.
Community Grid Engine: http://arc.liv.ac.uk/SGE/
More information about the users