ClassificationDiscriminant - How to exclude a predictor for only one group level?

Question

George 2013년 3월 25일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/68565-classificationdiscriminant-how-to-exclude-a-predictor-for-only-one-group-level

I am using ClassificationDiscriminant.fit to classify data. I seek to predict classes of Y, a nominal categorical variable, by X, a n-by-5 matrix of continous valued measurements.

Because some groups of Y (e.g. class level "l.b") have zero variance, I get the error:

“Error using ClassificationDiscriminant (line 624) Predictor x1 has zero variance for class l.b. Either exclude this predictor or set 'discrimType' to 'pseudoQuadratic' or 'diagQuadratic'.”

However, I cannot find user-guide, help notes or documentation that indicate how to exclude a predictor during this process. I do not want to use 'pseudoQuadratic' or the other, but would like to exclude the predictor causing the problem. Can someone please explain a simple efficient way to accomplish this?

The solution proposed to exclude an entire column of the matrix of predictors is not acceptable. This not a viable solution because this is a case where the predictor variance is zero for only one class level of the grouping variable. Removing the entire predictor variable will eliminate an important factor. What I figure is needed is: 1 ) to reassign that category of the grouping variable to another category; or 2 ) remove observations belonging to that category. Does anyone agree? What I am seeking is to use ClassificationDiscriminant.fit to proceed by excluding these zero-var cases without throwing an error, i.e. automatically exclude them. Assuming that is not possible, how do I access the variance information by category (group level) during ClassificationDiscriminant.fit, so that I can use its calculations to handle these cases prior to it throwing an error? Must I calculate the intra group level variances myself prior to submitting the data to ClassificationDiscriminant.fit?

Thank you.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Ilya 2013년 3월 25일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/68565-classificationdiscriminant-how-to-exclude-a-predictor-for-only-one-group-level#answer_79895

MATLAB Online에서 열기

Delete the corresponding column in the matrix of predictors you pass to ClassificationDiscriminant.fit:

X(:,1) = [];

댓글 수: 4
이전 댓글 2개 표시이전 댓글 2개 숨기기

Ilya 2013년 3월 27일

The quadratic discriminant model needs to compute a covariance matrix for each class. This covariance matrix then needs to be inverted to compute the discriminant coefficients. If a predictor has zero variance for one class, obviously the covariance matrix for that class is singular (non-invertible).

How you want to proceed depends on the goals of your analysis and your data. You haven't clearly stated the former and I don't have access to the latter.

You certainly can compute a covariance matrix for each group. If you want to build a quadratic discriminant model, you will need to invert these covariance matrices somehow. Popular options for inverting a singular covariance matrix are: Ignore off-diagonal elements (extreme regularization, 'diagQuadratic' option), take a pseudo-inverse ('pseudoQuadratic' option), and regularize by adding small positive values to the main diagonal (not provided in ClassificationDiscriminant for the quadratic model). Since you have already dismissed the first two, you might want to regularize or maybe think of something else.

George 2013년 3월 27일

Thanks again for your comments. Would you agree that it is also possible to overcome the problem by: reassigning the problematic category of the grouping variable to another category, or removing observations belonging to that category? Also, I recognize that other inversion methods are available. However, if in my code I have chosen to use quadratic discriminant model, then to prevent error, by the time I call that function (ClassificationDiscriminant.fit) I need to have ensured that none of the class levels have zero variance. This seems to be redundant, having to perform the calculations of the discriminant function prior to calling it. I suppose that using a try/catch statement and opting to use diag- or pseudoQuadratic option in the catch realm could work.

Ilya 2013년 3월 28일

If the predictor no longer has zero variance after you merge two groups or if you remove observations for the offending group, you will be able to carry out quadratic discriminant analysis. I have no opinion on whether merging two groups or removing observations from a certain group is a sound approach for your specific analysis.

Instead of using try/catch, you can always use the 'pseudoQuadratic' option. If a covariance matrix is not singular, its inverse is equal to its pseudo inverse. In this case, however, you won't know if one of the covariance matrices is singular because the analysis type in the returned object will be always set to 'pseudoQuadratic'.

댓글을 달려면 로그인하십시오.

ClassificationDiscriminant - How to exclude a predictor for only one group level?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 4
이전 댓글 2개 표시이전 댓글 2개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

Community Treasure Hunt

Classifica​tionDiscri​minant - How to exclude a predictor for only one group level?

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 4 이전 댓글 2개 표시이전 댓글 2개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

Community Treasure Hunt

ClassificationDiscriminant - How to exclude a predictor for only one group level?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 4
이전 댓글 2개 표시이전 댓글 2개 숨기기