Principal component analysis and feature reductions

Question

Diver 2015년 10월 18일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/249202-principal-component-analysis-and-feature-reductions

편집: the cyclist 2015년 10월 18일

Hi; I have a matrix composed of 35 features, I need to reduce those feature because I think many variable are dependent. I undertsood PCA could help me to do that, so using matlab, I calculated:

[coeff,score,latent] = pca(list_of_features)

I notice " coeff" contains matrix which I understood (correct me if I'm wrong) have column with high importance on the left, and second column with less importance and so on. However, it's not clear for me which column on " coeff" relate to which column on my original " list_of_features" so that I could know which variable is more important.

Thank you

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

the cyclist 2015년 10월 18일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/249202-principal-component-analysis-and-feature-reductions#answer_196356

편집: the cyclist 2015년 10월 18일

MATLAB Online에서 열기

It is true that the first column of coeff is the first principal component (PC), and is "most important" in the sense that it captures the greatest possible portion of the variation.

I think your best bet is to really dig into the documentation for pca, and the examples.

In the first example,

coeff =
   -0.0678   -0.6460    0.5673    0.5062
   -0.6785   -0.0200   -0.5440    0.4933
    0.0290    0.7553    0.4036    0.5156
    0.7309   -0.1085   -0.4684    0.4844

What that means is that the first PC is calculated as -0.0678 times your 1st variable, -0.6785 times your second variable, etc. [There are some nuances with respect to normalization and de-meaning of your data. Read the documentation!]

The second column of coeff gives PC 2, and so on.

It may be that you will get a high degree of dimension reduction, with a very small number of PC's capturing the vast majority of the variation. You can check this with the output explained, which reports the fraction of variation captured by each PC.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Principal component analysis and feature reductions

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

Principal component analysis and feature reductions

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기