PCA Analysis for clustering
조회 수: 7 (최근 30일)
이전 댓글 표시
Hello,
I have a dataset with 5 columns and 7500 rows. I need to find the minimum number of principal components needed to partition the data into the best number of clusters. I used the Princomp command to calculate the eigen values of the principal components but am not able to comprehend the # of principal components needed for the partition which parameter should I use?. Please kindly answer asap. Thanks.
Arun
댓글 수: 1
Geoff
2012년 5월 1일
My understanding of principal components is that it shows you the most significant orthogonal axes within your data. To me, that means something different to clustering. My approach to clustering is to solve with k-means using several different values of k, and then devise a suitable metric (the hard part) to determine how well my data is clustered for each value of k. I then choose the clustering (and k) which best satisfies my metric. There are probably more scientific ways to go about it.
답변 (0개)
참고 항목
카테고리
Help Center 및 File Exchange에서 Statistics and Machine Learning Toolbox에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!