Undefined function or variable "idxBest" in kmeans

조회 수: 2 (최근 30일)
thi
thi 2013년 2월 25일
Hello,
I have been using kmeans function from the statistics toolbox (with matlab R2012a on both windows and macos). And it often (not always, depending on the data) crashes with the error:
Undefined function or variable "idxBest".
Error in kmeans(line 332)
idx = idxBest;
The function kmeans is called with the following option:
[indNew_nodes ctrs] = kmeans(elem,k,'emptyaction','drop');
where k = 15.
I have been googleing this error, and people have reported problems when elem has NAN values. But it is not the case here.
For instance, in the last crash, elem is a 2000*1 vector:
-100.0000
-100.0000
1.8656
1.8656
1.8656
1.8656
-100.0000
-100.0000
4.9147
-100.0000
-100.0000
4.9147
-100.0000
-100.0000
-100.0000
4.9147
4.9147
4.0652
-100.0000
4.0652
4.0652
-100.0000
-100.0000
-100.0000
-100.0000
4.0652
4.0652
-100.0000
4.0652
-100.0000
-100.0000
4.0652
4.0652
-100.0000
-100.0000
-100.0000
-100.0000
-100.0000
-100.0000
-100.0000
4.0652
-100.0000
4.0652
-100.0000
4.0652
-100.0000
4.0652
4.0652
4.0652
4.0652
-100.0000
4.0652
4.0652
-100.0000
4.0652
4.0652
4.0652
-100.0000
1.8656
-100.0000
1.8656
1.8656
1.8656
1.8656
-100.0000
1.8656
1.8656
4.9147
4.9147
4.9147
4.9147
-100.0000
4.9147
4.9147
-100.0000
-100.0000
-100.0000
4.0652
4.0652
4.0652
-100.0000
4.0652
-100.0000
4.0652
-100.0000
-100.0000
-100.0000
-100.0000
-100.0000
-100.0000
4.0652
-100.0000
4.0652
4.0652
-100.0000
4.0652
4.0652
-100.0000
-100.0000
4.0652
4.0652
4.0652
4.0652
-100.0000
-100.0000
-100.0000
-100.0000
4.0652
4.9147
4.9147
4.9147
-100.0000
4.9147
4.9147
4.9147
4.9147
4.9147
4.0652
4.0652
4.0652
4.0652
4.0652
4.0652
4.0652
-100.0000
4.0652
4.0652
4.0652
4.0652
-100.0000
-100.0000
4.0652
-100.0000
-100.0000
-100.0000
-100.0000
-100.0000
-100.0000
-100.0000
-100.0000
-100.0000
4.0652
-100.0000
-100.0000
4.0652
-100.0000
4.0652
4.0652
4.0652
4.0652
4.0652
-100.0000
4.0652
-100.0000
4.0652
-100.0000
-100.0000
-100.0000
4.9147
4.9147
4.9147
4.9147
4.9147
4.9147
-100.0000
4.9147
4.9147
-100.0000
1.1611
-100.0000
1.1611
-100.0000
1.1611
1.1611
-100.0000
1.1611
-100.0000
1.8656
-100.0000
-100.0000
1.8656
-100.0000
-100.0000
-100.0000
4.8754
-100.0000
-100.0000
-100.0000
-100.0000
-100.0000
1.8656
1.8656
-100.0000
1.8656
1.8656
1.8656
1.8656
-100.0000
1.8656
Can anybody help and tell me why I get these errors?

답변 (2개)

Sean de Wolski
Sean de Wolski 2013년 2월 25일
which -all kmeans
It looks like you have something shadowing the Statistics Toolbox's kmeans()
  댓글 수: 1
thi
thi 2013년 3월 1일
no, it is the kmeans of the Statistics Toolbox. I since then made a copy in my working folder of kmeans.mat to edit it, but it is the same code.

댓글을 달려면 로그인하십시오.


Tom Lane
Tom Lane 2013년 2월 25일
This looks like a bug. I can see it has been fixed in the toolbox source code for distribution in an upcoming release. From what I can tell, it can only happen if you use 'drop' as the value of 'EmptyAction'. Would it be satisfactory to use 'singleton' instead? If not, and you are willing to edit the kmeans.m file, you could change this line:
totsumD = sum(sumD);
to this:
totsumD = sum(sumD(nonempties));
  댓글 수: 2
thi
thi 2013년 3월 1일
Thank you. It seems that 'singleton' no longer makes it crash as often (but still does from time to time). In this case, it create a new cluster consisting of the one point furthest from its centroid. But what is the difference, what is the difference? It is hard for me to assess the consequences on my program...
Tom Lane
Tom Lane 2013년 3월 1일
You are right, if a cluster disappears, the function seeks to maintain the requested number of clusters by selecting the point as you describe.
If you see an error using 'singleton', especially if you've edited the file, I'd welcome an e-mail from you with instructions on how to reproduce that.

댓글을 달려면 로그인하십시오.

태그

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by