[gridengine users] Unable to find pe_start file
Ursula Winkler
ursula.winkler at uni-graz.at
Tue Apr 3 09:42:24 UTC 2012
Reuti wrote:
> Hi,
>
> Am 03.04.2012 um 10:48 schrieb Ursula Winkler:
>
>
>> Dear members of gridengine,
>>
>> I have problems running mpi jobs with SGE. The error message is:
>> "unable to find pe_start file "/softw".
>>
>> Unfortunately not the whole path is printed so I can just guess.
>>
>> I created with qmon --> Parallel Environment Configuration the Environment: "mpi" -->
>> Start Proc Args: /software/sge/mpi/startmpi.sh -catch_rsh $pe_hostfile
>> Stop Proc Args: /software/sge/mpi/stopmpi.sh
>>
>
> which MPI implementation do you intend to use? With Open MPI and MPICH2 you get a Tight Integration nowadays without and special procedures.
>
>
I guess the tight integration without special procedures means I don't
need to determine "Start proc Args" and "Stop Proc Args" in the Parallel
Environment Configuration. I just did gave them the value "none" and now
the jobs are executed. Thanks.
But I have another problem: the mpi communication should go over the
infiniband-network and not over the management network (at the moment it
does the last one). To do so should I change the names in the /etc/hosts
file (the hostnames and the management interfaces have the same names,
and SGE selects hostnames not interfaces)? Or is there another possibility?
Ursula
More information about the users
mailing list