How to show Sample Size at Each Split in Tree using fitctree?

조회 수: 2 (최근 30일)
Justin
Justin 2014년 10월 29일
댓글: Justin 2014년 10월 31일
Am using fitctree, and of course, altering the MinLeaf size changes the tree output drastically, but also interested in seeing how the sample size shrinks as the tree progresses.
Know how?
thanks! Justin

답변 (1개)

Siddharth Sundar
Siddharth Sundar 2014년 10월 31일
If I understand correctly, you want to be able to extract the subset of observations used at each split in a node.
The CutPredictor property ClassificationTree object is what you need.
tree.CutPredictor returns the names of the variables used in each node. You can use this along with the output of the CutPoint property (gives you the values used as cut points in the tree) to generate the subset of observations by using the conditions obtained from the above properties to index into the training data set.
  댓글 수: 1
Justin
Justin 2014년 10월 31일
My clarity could have been better - What I am seeking is to know the number of observations at each node in the tree.
That is, say we started with a sample size of 1000 observations. It would be of interest to me know if the first node split that into 500 / 500 or 900 / 100; and so on for each node in the tree.

댓글을 달려면 로그인하십시오.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by