Measuring similarity between two individual trees in a model created by fitcensemble

조회 수: 1 (최근 30일)
I investigated for finding parameters to measure similarity between individual trees in ensemble.I think the most suitable ones are nodeSize, nodeProbability and nodeError. But when I watch the results of them, the mean of their values for individual trees do not regularly increase or decrease. So I am not be able to draw a conclusion. Could you suggest anything else for this? My code snippet for one model is below: t = templateTree('MaxNumSplits',cell2mat(treeDepth)); % Weak-learner template tree object C1 = fitcensemble(X_train,Y_train,'Method','RUSBoost','Learners',t); [labels,scores] = predict(C1,X_test); meanSuccessRate=calculateMeanSuccessRate(scores,labels,Y_test); sonuclar(i,j)=meanSuccessRate; . . . nodeSize(d,z)=mean(C1.Trained{z}.NodeSize); nodeProbability(d,z)=mean(C1.Trained{z}.NodeProbability); nodeError(d,z)=mean(C1.Trained{z}.NodeError);

답변 (1개)

Aditya Patil
Aditya Patil 2020년 12월 23일
Comparing decision trees is not straightforward as they can have different structures, different variables at each node, and different condtions on those variables.
Generally you should evaluate the properties of the ensemble as a whole, and not that of individual trees.

카테고리

Help CenterFile Exchange에서 Classification Ensembles에 대해 자세히 알아보기

태그

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by