Removing outliers from a matrix
조회 수: 1 (최근 30일)
이전 댓글 표시
I removed outliers from my each column of my feature matrix, but now I have feature vectors with differing lengths. I cannot make them the same length because I will lose data from a certain range. If I shuffle, then the correlation between my features and my outcome becomes messy. How can I do that? My matrix of features and outcomes is a 55000*14 matrix. The last column is the outputs.
댓글 수: 1
Davide Masiello
2022년 10월 11일
Maybe, instead of removing the outliers, you could replace them with an interpolated value.
답변 (1개)
Bjorn Gustavsson
2022년 10월 11일
편집: Bjorn Gustavsson
2022년 10월 11일
QD-answer: The best you can do for single pairs of columns is to only use the rows where neither are outliers. With the cov-function you can get this handled with the options 'omitrows' or 'partialrows' if you replace each outlier with a nan. If you want to use the corrcoef-function it has a slightly different interface. It takes a parameter-value pair to set the options for 'rows', see the help and documentation for description.
HTH
댓글 수: 0
참고 항목
카테고리
Help Center 및 File Exchange에서 Descriptive Statistics에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!