Generalization in the ANN

Question

0 개 추천

A network was created with 2000 records and aslo 5 networks were created with 400 records for each of them.(Note that the input data with 2000 records was divided to build 5 individual networks)Now the performance of each subnetworks is better than larg networks with 2000 records.Can we conclude that the larg network learnt too much from the examples given during the training, thus loosing the capability to generalize on the basis of new examples (overfitting)? but the small networks performed better because they had the less training records? Thanks a lot for any advice.

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Follow Question

Answer 1

Greg Heath 2016년 10월 22일

0 개 추천

> Can we conclude that the larger network learnt too much from the examples given during the training, thus loosing the capability to generalize on the basis of new examples (overfitting)? but the small networks performed better because they had the less training records?

ABSOLUTELY NOT!

Results heavily depend on how the data is divided. For example: randomly vs by sections.

You apparently misunderstand the concepts of overfitting and overtraining:

OVERFITTING: There are more unknown weights than training equations. This allows an infinite number of minima for training data ( How many solutions {x1,x2} are there for the problem x1+x2 = 1 ?!) which are not minima for nontraining (i.e., validation and testing) data.

OVERTRAINING: Training an overfit network beyond the point where performance on NONTRAINING data begins to deteriorate.

As long as all data is representative of the general I/O mapping, the more data, the better. That is why random datadivision is the default in MATLAB NN training programs.

Hope this helps.

Thank you for formally accepting my answer

Greg

댓글 수: 3
이전 댓글 1개 표시 이전 댓글 1개 숨기기

Greg Heath 2016년 10월 26일

No: IF the data are representative of the general I/O mapping.

This is easy to check:

Compare the performance of each net on all of the data.

Rita 2016년 10월 26일

편집: Rita 2016년 10월 26일

Unfortunately, the performance of nets(5nets) with all of the data was not good.I also tried to examine the performance of each small nets with the data of each year and the results were not good too.so it seems that something wrong with data or nets?? Thanks again Greg

댓글을 달려면 로그인하십시오.

Generalization in the ANN

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 3
이전 댓글 1개 표시 이전 댓글 1개 숨기기

카테고리

태그

Community Treasure Hunt

Generalization in the ANN

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 3 이전 댓글 1개 표시 이전 댓글 1개 숨기기

카테고리

태그

참고 항목

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 3
이전 댓글 1개 표시 이전 댓글 1개 숨기기