Distinguish between 2 variables using PCA

Question

moose 2015년 8월 9일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/232981-distinguish-between-2-variables-using-pca

편집: Sagar 2015년 8월 10일

Hello, I am trying to understand the PCA function. I have 6 recordings of Heart Rate. Four of them from person A, and two of them from person B. Does PCA can help me somehow to distinguish between the 2 persons? I mean, when I do coeff = pca(signal_matrix); ('signal_matrix' is the matrix of my 6 recordings) what exactly can I get from the coeff matrix I receive? How should I interpret it?

Thank you in advance.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Sagar 2015년 8월 9일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/232981-distinguish-between-2-variables-using-pca#answer_188760

PCA can certainly give some insights in your problem. Run PCA in your data and look at the different principal components. In you case, I guess the four variables would dominate in one principal component (presumably first) representing characteristics of A and the rest two variables would dominate another principal component (presumably second) representing characteristics of B. To make it more clear, when you look at the coefficients of the first principal component,the first four values should have higher values than the rest two. Similarly, in the second principal component, the last two coefficients should have higher values than the first four. Of course, I am presuming that A and B are distinguishable.

댓글 수: 2
없음 표시없음 숨기기

moose 2015년 8월 10일

Thank you Sagar. Can you please be more explicit (sorry, I am a bit new to this) - My coeff matrix is 6x6 (I've add a picture). What exactly should I look at? my input matrix is 3000x6, where the first 2 columns are recordings from person A, and the 4 last columns are from person B.

Sagar 2015년 8월 10일

편집: Sagar 2015년 8월 10일

In you first principal component, second element has the highest weight (0.9975) so it means that this component represents the characteristics of the second recording for A. Similarly, in the second principal component, first value has the largest weight (0.9966) so it represents characteristics of second recording of A. Similarly look at the highest values in other columns. But most importantly, look at the percentage variance explained by using a complete formula, [coeff,score,latent] = pca(___), where latent is the variance explained by the principal components. First value in latent divided by the sum of all the values in latent gives you the % variance explained by the first principal component. From those values you can know which components are important and which you can choose to drop. For further understanding, read this post: https://onlinecourses.science.psu.edu/stat505/node/54

댓글을 달려면 로그인하십시오.

Distinguish between 2 variables using PCA

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 2
없음 표시없음 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

Distinguish between 2 variables using PCA

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 2 없음 표시없음 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 2
없음 표시없음 숨기기