필터 지우기
필터 지우기

How is varimportance calculated?

조회 수: 3 (최근 30일)
Sandro
Sandro 2011년 10월 13일
Hello,
I'm using varimportance function from the classregtree package. I would like to know the theory behind this function since the Matlab help is very light.
Thanks in advance. Regards

채택된 답변

Ilya
Ilya 2011년 10월 13일
If you split parent node 1 into child nodes 2 and 3, you increase the importance of the split variable by
(R1 - R2 - R3)/Nnode
where Ri are node risks and Nnode is the total number of nodes in this tree. Risk is defined as
Ri = Pi*Ei
where Pi is the node probability and Ei is either node error (for trees grown by minimizing a non-impurity criterion such as MSE or the twoing criterion) or node impurity (for trees grown by minimizing an impurity criterion such as Gini index or deviance).
Is this what you are looking for? If yes, can you say why this was not clear to you from the documentation?

추가 답변 (2개)

Sandro
Sandro 2011년 10월 14일
Dear Ilya,
Thanks for your answer. Yes, it was exactly what I was looking for. It is much clearer with the equations (which are not in the documentation).
Regards

Gonzalo
Gonzalo 2012년 6월 13일
G. G. Moisen (2008), Classification and Regression Trees, Ecological Informatics 582-588 says :
"The importance of individual explanatory variables can be determined by measuring the proportion of variability accounted for by splits associated with each explanatory variable. Alternatively, one may address variable importance by determining the effect of excluding variables in turn and by assessing the resulting predictive accuracy of the resulting models."

카테고리

Help CenterFile Exchange에서 Statistics and Machine Learning Toolbox에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by