How to handle a special case of missing data in Neural Networks?

Question

Miles Brim 2021년 9월 1일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1444804-how-to-handle-a-special-case-of-missing-data-in-neural-networks

답변: David Willingham 2021년 9월 1일

Hello,

I need to train a classifier with a dataset composed of events that occur for only a subset of my records. For instance, suppose there are 100 records. Let Event 1 occur for all records, Event 2 occur for records 1:50, and Event 3 occur for records 51:100. Suppose each Event has 1 dimension of data. So I have a dataset of 3 columns with column 2 and 3 have 50 NaNs each. These records are empty because there is no data. I do not want MATALB to attempt to estimate their values. I would like to build a model that allows for these values to be empty.

I have found one option that forces the bias terms for your connections to be zero, in which case I could make all my missing records zero. However, the non-missing values sometimes have a magnitude of zero and so for these cases the bias needs to be non-zero.

Is there a way to make each input to a neural network two-dimensional, the first dimension being the training data and the second value a 0 or 1 indicating whether to overide the output to be 0 and to have the back-propogaiton ignore that branch of the node?

My actual dataset is high-dimensional with a complex collection of overlapping events. I have built a custom MAP classifier that accounts for the particular nature of the data. I want to compare its performance with a deep ANN and with some other clustering technique. Accordingly, I will have this same quesiton for something like KNN.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

David Willingham 2021년 9월 1일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1444804-how-to-handle-a-special-case-of-missing-data-in-neural-networks#answer_779009

Hi Miles,

I'd recommend adding additional 2 columns that indicate when Event 2 & 3 are active. I.e. the first column will have a 1 for when Event 2 is active, 0 when it's not. And the next column has the same for Event 3.

Regards,

David

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

How to handle a special case of missing data in Neural Networks?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

How to handle a special case of missing data in Neural Networks?

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기