[gridengine users] Complex configuration messes up pe_slots

Reuti reuti at staff.uni-marburg.de
Sun Mar 13 19:38:40 UTC 2011


Hi,

Am 13.03.2011 um 17:18 schrieb Tal Hazan:

> I encountered a strange issue with sge6.2u5 :
> 
> I've added new variable, 'vcsruntime_net' with Y/Y/0/0 settings and added it to global host settings with value of 8.

here you use 8 for "vcsruntime_net".


> Using the following command puts the job in 'qw' mode, scheduling information shows: cannot run in PE "serial" because it only offers 8 slots.
> 
> root$ qrsh -p 0 -now no -q all.q -pe serial 10 -l vcsruntime_net=1 -display nx2:2001.0 -cwd -V -verbose -N simv simv

And here you request 10 (as it's multiplied by the slot count). Hence it can't run, as you have only 8 globally available. You can use "consumable JOB" if it shouldn't be multiplied.

Can you explain the intention of your setup - I'm puzzled that a PE is named "serial".

-- Reuti


> it looks like –pe serial 8 considers vcsruntime_net as the slots setting.
> 
> qstat –f:
> queuename                      qtype resv/used/tot. load_avg arch          states
> ---------------------------------------------------------------------------------
> all.q at rtlsrv03                 BIP   0/30/40        3.07     lx24-amd64   
> ---------------------------------------------------------------------------------
> emergency at rtlsrv01             BIP   0/0/1          0.02     lx24-amd64   
> ---------------------------------------------------------------------------------
> emergency at rtlsrv02             BIP   0/0/1          1.00     lx24-amd64   
> ---------------------------------------------------------------------------------
> emergency at rtlsrv03             BIP   0/0/1          3.07     lx24-amd64   
> ---------------------------------------------------------------------------------
> all.q at rtlsrv02                 BIP   0/0/40         1.00     lx24-amd64   
> ---------------------------------------------------------------------------------
> all.q at rtlsrv01                 BIP   0/0/40         0.02     lx24-amd64   
> 
> ############################################################################
> - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS
> ############################################################################
> 143719 0.40756 simv       root         qw    03/13/2011 16:32:33    10       
> 
> Qconf –sp serial:
> pe_name            serial
> slots              120
> user_lists         arusers deadlineusers defaultdepartment rtl verification
> xuser_lists        NONE
> start_proc_args    /bin/true
> stop_proc_args     /bin/true
> allocation_rule    $pe_slots
> control_slaves     FALSE
> job_is_first_task  TRUE
> urgency_slots      min
> accounting_summary TRUE
> 
> Thanks,
> tal.
> _______________________________________________
> users mailing list
> users at gridengine.org
> https://gridengine.org/mailman/listinfo/users




More information about the users mailing list