[gridengine users] setting up Son of Grid Engine on CentOS 7

William Hay w.hay at ucl.ac.uk
Fri Jul 3 07:53:00 UTC 2015


On Fri, 3 Jul 2015 02:53:49 +0000
Tomas Vaisar <tvaisar at u.washington.edu> wrote:

> Hi,
> 
> I am trying to set up SoGE v 8.1.8 on a CentOS 7 box and am getting
> several errors when I start the sgemaster (listing below): first -
> NULL pointer passed as object name for "STN_name" and then several
> commlib errors about ssl
> 
> Can't find anything about the "STN_name" in SGE documentation.
> Would anybody have suggestions about the source of these errors and
> how to fix them?


> 07/02/2015 17:38:03| main|proton|I|read job database with 0 entries
> in 0 seconds 07/02/2015 17:38:03| main|proton|E|NULL pointer passed
> as object name for "STN_name" 07/02/2015 17:38:03|

A quick poke through the source suggests that STN_name is the name of a
node in the share tree (usually either a project, a user or default).

I'd try running qconf -sstree to see how the share tree is set up.

If it has somehow become corrupted try loading a clean new share tree
into grid engine with qconf -Astree.

> main|proton|I|qmaster hard descriptor limit is set to 4096 07/02/2015
> 17:38:03| main|proton|I|qmaster soft descriptor limit is set to 1024
> 07/02/2015 17:38:03| main|proton|I|qmaster will use max. 1004 file
> descriptors for communication 07/02/2015 17:38:03|
> main|proton|I|qmaster will accept max. 950 dynamic event clients
> 07/02/2015 17:38:03| main|proton|I|starting up SGE 8.1.8 (csp)
> (lx-amd64) 07/02/2015 17:38:09|listen|proton|E|commlib error: ssl
> accept error (ssl accept error for client "proton") 07/02/2015
> 17:38:09|listen|proton|E|commlib error: ssl error ([ID=218910881] in
> module "asn1 encoding routines": "unknown message digest algorithm")
> 07/02/2015 17:38:39|listen|proton|E|commlib error: ssl accept error
> (ssl accept error for client "proton") 07/02/2015

As for the SSL error, converting the ID quoted to hexadecimal format
got me D0C50A1 which lead me,via google, to this page:
https://www.tbs-certificates.co.uk/FAQ/en/old_openssl_sha256.html

It would appear you have an old version of OpenSSL on some of your
machines.  Find the host with the old version, upgrade it and possibly
recompile  grid engine components against the new version as well.

William
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 819 bytes
Desc: OpenPGP digital signature
URL: <http://gridengine.org/pipermail/users/attachments/20150703/9225eada/attachment.sig>


More information about the users mailing list