Predictor Importance feature for Tree Ensemble (Random Forest) method
조회 수: 24 (최근 30일)
이전 댓글 표시
Hello, It seems that MATLAB package has two approaches for calculating variable importance:
The first is "predictorImportance": http://www.mathworks.com/help/stats/compactregressionensemble.predictorimportance.html
The second is permutation method: http://www.mathworks.com/help/stats/treebagger.oobpermutedvardeltaerror.html
I'm wondering what are the difference between the two approaches, and which is preferred?
Thanks!
댓글 수: 0
답변 (1개)
Prashanth Ravindran
2016년 2월 8일
This query was asked back in 2013. I will try to answer for those people who might be looking for the answer.
predictorImportance. This function has input as the ensemble created by the fitensembe function. And this function can be used to create many different kinds of ensembles such as boosting trees, bagging trees, etc..
treebagger.oobpermutedvardeltaerror: Yes this is an output from the Treebagger function in matlab which implements random forests. This can also be used to implement baggin trees by setting the 'NumPredictorsToSample' to 'all'.
You see the basic algorithms are different for the two functions and hence the outputs may be different.
댓글 수: 4
Zainab Al-RubayezayMATH
2018년 11월 4일
Hi
I got a negative result of feature importance as well when I used Treebagger. However, I got a positive result when I try to know what are the most important features of the same dataset by applying predictorImportance for the model result from ensemble.
Does anyone know the reasons?
Thanks Zainab
Shanning Bao
2019년 4월 10일
For why the feature importance may be negative:
Seems useful
참고 항목
카테고리
Help Center 및 File Exchange에서 Regression Tree Ensembles에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!