MATLAB Answers

Cluster Profile Manager SPMD job test Failed

조회 수: 9(최근 30일)
Soumak Bhattacharjee
Soumak Bhattacharjee 28 Nov 2019
MATLAB 9.6.0.1214997 (R2019a) Update 6
I could not use parfor so I tried to Validate my cluster in Cluster Profile Manager.
Cluster Connection test and job test passed, but SPMD job test failed with the following report: attached file.
I tried doing the following
distcomp.feature( 'LocalUseMpiexec', false )
but it was of no avail.
I also tried the following: MATLAB parloop error solution but it doesn't work either.

  댓글 수: 0

로그인 to comment.

답변(1개)

Edric Ellis
Edric Ellis 29 Nov 2019
I'm going to guess you're using Linux. This is probably related to your ulimit settings, probably the limit on number of processes. Check
$ ulimit -u
or
$ ulimit -a
to report all limits.

  댓글 수: 3

Soumak Bhattacharjee
Soumak Bhattacharjee 29 Nov 2019
Yes, I am. The output is as follows:
$ ulimit -u
1024
$ ulimit -a
core file size (blocks, -c) 0
data seg size (kbytes, -d) unlimited
scheduling priority (-e) 0
file size (blocks, -f) unlimited
pending signals (-i) 2067606
max locked memory (kbytes, -l) unlimited
max memory size (kbytes, -m) unlimited
open files (-n) 4096
pipe size (512 bytes, -p) 8
POSIX message queues (bytes, -q) 819200
real-time priority (-r) 0
stack size (kbytes, -s) unlimited
cpu time (seconds, -t) unlimited
max user processes (-u) 1024
virtual memory (kbytes, -v) unlimited
file locks (-x) unlimited
PS: The problem persists.
Stage: SPMD job test (createCommunicatingJob)
Status: Failed
Edric Ellis
Edric Ellis 29 Nov 2019
Try:
$ ulimit -u 63536
Soumak Bhattacharjee
Soumak Bhattacharjee 29 Nov 2019
Nope. This the problem still persists.
Stage: SPMD job test (createCommunicatingJob)
Status: Failed
Start Time: Mon Dec 23 07:06:46 IST 2019
Finish Time: Mon Dec 23 07:06:59 IST 2019
Running Duration: 0 min 13 sec
Description: Job errored or did not reach the state 'finished'.
Error Report: Job errored or did not reach the state 'finished'.
Command Line Output:
Debug Log: LOG FILE OUTPUT:
[7]thread_monitor Resource temporarily unavailable in pthread_create
[7]thread_monitor Resource temporarily unavailable in pthread_create
[7]thread_monitor Resource temporarily unavailable in pthread_create
[7]thread_monitor Resource temporarily unavailable in pthread_create
[7]thread_monitor Resource temporarily unavailable in pthread_create
[7]thread_monitor Resource temporarily unavailable in pthread_create
[7]thread_monitor Resource temporarily unavailable in pthread_create
[7]thread_monitor Resource temporarily unavailable in pthread_create
[7]thread_monitor Resource temporarily unavailable in pthread_create
[7]thread_monitor Resource temporarily unavailable in pthread_create
[7]thread_monitor Resource temporarily unavailable in pthread_create
[7]thread_monitor Resource temporarily unavailable in pthread_create
[7]thread_monitor Resource temporarily unavailable in pthread_create
[26]thread_monitor Resource temporarily unavailable in pthread_create
[31]thread_monitor Resource temporarily unavailable in pthread_create

로그인 to comment.

이 질문에 답변하려면 로그인을(를) 수행하십시오.


Translated by