[gridengine users] Upgrading issues...

Jesse Becker beckerjes at mail.nih.gov
Tue Sep 27 13:32:19 UTC 2011


Well, usually the logs are in:

	$SGE_ROOT/$SGE_CELL/common/  (accounting/reporting/scheduling)
	$SGE_ROOT/$SGE_CELL/spool/qmaster/  (qmaster)

I had to make a similar path change some time ago, and it was basically
a mater of tracking down every instance of "lx24" and changing it to
"lx26".  "grep -r" is your friend, as are good backups.  Also make sure
to check all of the profile scripts in /etc/, and expect at least a few
of your users to have it hard-coded in their own environments as well.




On Tue, Sep 27, 2011 at 05:33:19AM -0400, Vic wrote:
>
>Hi All.
>
>I've got myself into a mess, and I'd appreciate any pointers you could
>give me to get out...
>
>I've got a grid based on 6.2u2 (I believe). Pretty much everything lives
>in /ope/sge/ . That filesystem is NFS-exported from one machine, and
>mounted on all the execution nodes. The binaries live in
>/opt/sge/bin/lx24-amd64/ . That works nicely enough.
>
>I'm trying to upgrade to 6.2u5 using Dave Love's RPMs. I've moved the NFS
>mount and installed the RPMs into /opt/sge on a test machine. I
>bind-mounted the /opt/sge/default and /ope/sge/Build directories back into
>that (there appear to be spool directories down there).
>
>Note that the binaries now live in /opt/sge/bin/lx26-amd64/ . The
>environment is pretty much the same as before, with just the path updated
>to cope with the change of arch.
>
>sge_execd starts. But when I try to run any commands, they hang. strace
>shows me the comms process starting with my queue master, but then it sits
>in a loop polling the master for data (which never comes). I can't even
>qconf on that machine.
>
>Can anyone tell me where to start looking for logs? I've found precious
>little (a one-liner in /tmp that didn't seem to help).
>
>Thanks!
>
>Vic.
>
>
>
>_______________________________________________
>users mailing list
>users at gridengine.org
>https://gridengine.org/mailman/listinfo/users

-- 
Jesse Becker
NHGRI Linux support (Digicon Contractor)


More information about the users mailing list