필터 지우기
필터 지우기

Important predictors in PCA analysis and Pareto

조회 수: 6 (최근 30일)
Tala Hed
Tala Hed 2017년 7월 21일
댓글: the cyclist 2017년 7월 22일
friends, I have a data set of (1400*4); 4 parameters (mass, area, distance, color) are measured for a sample size of 1400. In order to find the important predictors, I used PCA command after the data were normalized. [pcs,scrs,~,~,pexp] = pca(statsNorm); Then I plotted the Pareto chart ( pareto(pexp)) and the labels beneath each par are just (1, 2, 3 and 4). How can I figure out the correspondence between these numbers and the predictors? In other words, what is 1! mass, area, distance or color?

채택된 답변

the cyclist
the cyclist 2017년 7월 21일
The first bar corresponds to the first "principal component". It will not be one of your original variables, but rather the linear combination of your four variables that captures the largest amount of total variance.
This may sound unkind, but your question shows that you don't under what PCA is intended to do. I don't mean the MATLAB pca command, but rather the technique itself.
Rather than explain the whole technique here, and what it is useful for, I would encourage you to read a simple article about it -- maybe even just Wikipedia.
What are you trying to predict? PCA, by itself, is not really a predictive modeling technique.
  댓글 수: 2
Tala Hed
Tala Hed 2017년 7월 21일
you may be right since I am very new to machine learning but I am trying to cluster my data, no prediction yet. I get a plot after I employ the following commands: [pcs,scrs,~,~,pexp] = pca(statsNorm); grp = kmeans(statsNorm,2,'Replicates',5); gscatter(scrs(:,1),scrs(:,2),grp,'br','xo')
What I get is a plot of two clusters and I have no idea what are the axis in the plot.
the cyclist
the cyclist 2017년 7월 22일
I'm afraid that it is just not very practical to give you an entire tutorial on the meaning the principal components, and their relationship to the original data. If you are going to use this technique, you really need to learn it from some other source than me.

댓글을 달려면 로그인하십시오.

추가 답변 (0개)

카테고리

Help CenterFile Exchange에서 Dimensionality Reduction and Feature Extraction에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by