[gridengine users] Complex configuration messes up pe_slots
Reuti
reuti at staff.uni-marburg.de
Sun Mar 13 19:38:40 UTC 2011
Hi,
Am 13.03.2011 um 17:18 schrieb Tal Hazan:
> I encountered a strange issue with sge6.2u5 :
>
> I've added new variable, 'vcsruntime_net' with Y/Y/0/0 settings and added it to global host settings with value of 8.
here you use 8 for "vcsruntime_net".
> Using the following command puts the job in 'qw' mode, scheduling information shows: cannot run in PE "serial" because it only offers 8 slots.
>
> root$ qrsh -p 0 -now no -q all.q -pe serial 10 -l vcsruntime_net=1 -display nx2:2001.0 -cwd -V -verbose -N simv simv
And here you request 10 (as it's multiplied by the slot count). Hence it can't run, as you have only 8 globally available. You can use "consumable JOB" if it shouldn't be multiplied.
Can you explain the intention of your setup - I'm puzzled that a PE is named "serial".
-- Reuti
> it looks like –pe serial 8 considers vcsruntime_net as the slots setting.
>
> qstat –f:
> queuename qtype resv/used/tot. load_avg arch states
> ---------------------------------------------------------------------------------
> all.q at rtlsrv03 BIP 0/30/40 3.07 lx24-amd64
> ---------------------------------------------------------------------------------
> emergency at rtlsrv01 BIP 0/0/1 0.02 lx24-amd64
> ---------------------------------------------------------------------------------
> emergency at rtlsrv02 BIP 0/0/1 1.00 lx24-amd64
> ---------------------------------------------------------------------------------
> emergency at rtlsrv03 BIP 0/0/1 3.07 lx24-amd64
> ---------------------------------------------------------------------------------
> all.q at rtlsrv02 BIP 0/0/40 1.00 lx24-amd64
> ---------------------------------------------------------------------------------
> all.q at rtlsrv01 BIP 0/0/40 0.02 lx24-amd64
>
> ############################################################################
> - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS
> ############################################################################
> 143719 0.40756 simv root qw 03/13/2011 16:32:33 10
>
> Qconf –sp serial:
> pe_name serial
> slots 120
> user_lists arusers deadlineusers defaultdepartment rtl verification
> xuser_lists NONE
> start_proc_args /bin/true
> stop_proc_args /bin/true
> allocation_rule $pe_slots
> control_slaves FALSE
> job_is_first_task TRUE
> urgency_slots min
> accounting_summary TRUE
>
> Thanks,
> tal.
> _______________________________________________
> users mailing list
> users at gridengine.org
> https://gridengine.org/mailman/listinfo/users
More information about the users
mailing list