Feature selection with NaN
조회 수: 3 (최근 30일)
이전 댓글 표시
Hi,
I have a high dimensional data where I've managed to build a classification model using fitctree that is returning satisfactory accuracy. The predictors contain a decent proportion of unknown values represented as NaN.
I chose fitctree because it can handle the unknowns. Now I need to reduce the number of predictors using feature selection because recording all the predictors in the final model is not practical.
Is there a feature selection function that will ignore unknown values? I have looked at fscnca and stepwiselm but both don't seem to work. Removing rows containing NaN in the predictor will ignore many other potentially useful predictors and there is no easy way to replace/estimate the unknowns.
Thank you.
댓글 수: 0
채택된 답변
Prajit T R
2018년 3월 22일
Hi Azura,
F = fillmissing(A,method) fills missing entries using the method specified by method, which can be one of the following: previous, next, nearest, linear,spline, pchip.
Cheers
댓글 수: 0
추가 답변 (0개)
참고 항목
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!