How to operate on a count dataset (positive whole numbers with a lot of zeros)?

Question

aditya ramesh 2016년 6월 15일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/290156-how-to-operate-on-a-count-dataset-positive-whole-numbers-with-a-lot-of-zeros

댓글: Greg Heath 2016년 8월 1일

So i have some dataset, which is basically a count dataset. I have my own code for the classification using neural networks. Turns out that the data does not have a lot of correlation so accuracies as high as 55% is acceptable.For some reason the output of my neural network turns out to be the same prediction no matter what the input is on the testing set.I think this has something to do with the data, as the weights are not being learnt at all. The code i have written was tried on 5 different UCI classification datasets and it worked perfectly. Any kind of help is appreciated ,i would not mind sharing the code as well.

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Greg Heath 2016년 6월 15일

Do you really expect us to solve your problem with the information you have given us so far

Greg?

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Anurag Gupta 2016년 8월 1일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/290156-how-to-operate-on-a-count-dataset-positive-whole-numbers-with-a-lot-of-zeros#answer_230400

It is my understanding that your neural network is giving same output which is independent of the test data set. You would like to know the probable reason of this problem and some methods which can solve this.

This type of problem can happen when you are not choosing your activation function and cost function appropriately or your data is not normalized. I will recommend to use Sigmoid activation function and cross entropy cost function as this combination eliminates lot of problems. I will suggest to first normalize your data before separating the data in to training, cross validation and test data sets. I am assuming your data set is linearly separable. So, try using 3 layer ANN, start with using less number of hidden layer units first and increase number of hidden units gradually if the error is more. If the data is not linearly separable, try increasing the number of hidden layer.

Refer to the following documentation for more information.

http://in.mathworks.com/help/nnet/gs/classify-patterns-with-a-neural-network.html

Regards,

Anurag Gupta

MathWorks India

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Greg Heath 2016년 8월 1일

AG: If your data is linearly separable you do not need hidden layers. In fact you don't even need a neural network.

AR: If your code works on 5 UCI datasets, then, my guess is that the current data set is very unconventional. Therefore I suggest plotting and clustering. You do not mention the dimensionality.

Hope this helps.

Greg

댓글을 달려면 로그인하십시오.

How to operate on a count dataset (positive whole numbers with a lot of zeros)?

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

답변 (1개)

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

How to operate on a count dataset (positive whole numbers with a lot of zeros)?

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

답변 (1개)

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기