How does tree bagger handle NaN values

조회 수: 9 (최근 30일)
Jason Summers
Jason Summers 2020년 2월 7일
답변: Puru Kathuria 2020년 12월 27일
In building a random forest classifier I have some features with a large amount of NaN values, but it is not clear to me how Tree Bagger handles these NaNs. I've seen quite a bit of documentation of how that is handled in other high level programming languages, but I don't see explicitly how this is done in Matlab. Can anyone point me in the right direction so I can understand the default settings for this or user specified settings?

답변 (1개)

Puru Kathuria
Puru Kathuria 2020년 12월 27일
General rules that are followed while NaN or missing values are encountered:
  • Rule1: The algorithm simply discards the data points where all the features have NaN values and does not use them while training.
  • Rule 2: If a data point have a few NaN feature values then the algorithm will find the split on the basis of valid values first.

카테고리

Help CenterFile Exchange에서 Classification에 대해 자세히 알아보기

제품


릴리스

R2017b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by