for the give data 100*2 how many clusters will get. is there any relation with any parameter r any formula?

조회 수: 1 (최근 30일)
based on the obtained data how to predict clusters?

답변 (1개)

Ameer Hamza
Ameer Hamza 2018년 5월 21일
The problem of choosing the correct numbers of clusters is quite subjective and depends on your application requirement. You can refer to these links to get some idea about how to determine an optimal number of clusters from the dataset:
MATLAB clustering algorithm e.g. kmeans() allows to cluster data once you know the number of the cluster in which you want to divide the data.
  댓글 수: 2
John D'Errico
John D'Errico 2018년 5월 21일
Exactly. It can be difficult to know in advance how many clusters you need. If there were an easy way, using some magic formula, the code would use it, or at least offer such an option.
Walter Roberson
Walter Roberson 2018년 5월 21일
In the past I had occasion to test several algorithms to determine the number of clusters. None of the ones I tested produced good results. They depended on arbitrary tuning parameters whose value could not be justified, or they were sensitive to small accidents of point placement, or they would say that 27 clusters should be used instead of 4 because they had weak penalties for generating more clusters (best fit for clustering us always one cluster per unique point unless there are penalties against cluster creation.)
The ones with tunable parameters... it wasn't that they didn't work (in some sense), but rather that we were trying to produce unsupervised learning for situations where we would have no reason to expect that clusters would have a particular shape. So the only effective way to use the algorithm was to run the tunable parameters themselves through a search algorithm...

댓글을 달려면 로그인하십시오.

카테고리

Help CenterFile Exchange에서 Cluster Analysis and Anomaly Detection에 대해 자세히 알아보기

태그

제품


릴리스

R2016a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by