How to train feedforward network to solve XOR function

Question

0 개 추천

im new in matlab, please sorry if its stupid question. and sorry my english.

trying to train feedforward network to solve XOR function

1 hidden layer with 2 neurons, other settings are default: TANSIG, Backprop, TRAINLM, LEARNGDM, MSE

R2012b matlab version

close all, clear all, clc, format compact
p = [0 1 0 1 ; 0 0 1 1];
t = [0 1 1 0];
net = feedforwardnet(2,'trainlm');
net = train(net,p,t);
a = net(p)

ive tried this code, and tried 'nntool' and 'nnstart' too. its always seems like training algorithm splits 'p' set for

2 - training set,

1 - validation set,

1 - testing set

as a result - network is training on partial data (2 pair of digits instead 4), and training process generates Validation done or Minimum gradient reached (1.00e-010) in very few iteration (1-10 iterations) and simulation shows that network untrained.

Is my guess right (about splitting 'p' set)?
how i can manually give validation data (input and output sets) to training algorithm?
should i somehow expand 'p' and 't' sets, and then use divideblock?
any other ideas?

thanx!

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Follow Question

Answer 1

Greg Heath 2013년 2월 16일

편집: Greg Heath 2013년 2월 16일

MATLAB Online에서 열기

1 개 추천

 1.[ I N ] = size(x)   % [ 2 4 ]
   [ O N ] = size(t)   % [ 1 4 ]
   Neq = prod(size(t)  % 4 = No. of training equations

2. For tthis small data set it doesn't make sense to use data division for validation stopping. So,

net.divideFcn = 'dividetrain'; % or equivalently, = ' ';

3. Since the No. of estimated weights for H hidden nodes is

 %Nw = (I+1)*O          = 3 for H=0
 %Nw = (I+1)*H+(H+1)*O  for H >0

the condition Neq >= Nw yields the following upper bound for H

Hub = (Neq-O)/(I+O+1) % 3/4

which is only possible for H = 0 (no hidden layer). However from a 2-dimensional plot we know that it will take at least 2 hidden nodes to separate the "0" class diagonal corners [ 0 1; 0 1 ] from the "1" class diagonal corners [ 1 0 ; 0 1].

Subsequently, for H = 2, Nw = 9 > Neq = 4. Therefore, there will be an infinite number of solutions for the weights.

net = patternnet(2); % for classification

4. Choose MSEgoal so that the coefficient of determination ( or R^2, see wikipedia) is >= 0.99 . Then the model will represent at least 99% of the biased target variance:

net.trainParam.goal = 0.01*var(t',1);

5. The success of the design depends on the placement of the random initial weights. Therefore it may be necessary to make Ntrials >= 10 separate designs (use a do loop).

6. When training the net use the extended output form

[ net tr y e ] = train(net,x,t);

Then, everything you need to know, besides the output y and error e, can be obtained directly from the training structure tr.

7. It is STRONGLY recommended that somewhere along the line you should investigate the contents of tr.

Hope this helps.

Thank you for formally accepting my answer. *

Greg

댓글 수: 2
없음 표시 없음 숨기기

Albert 2013년 2월 17일

MATLAB Online에서 열기

close all, clear all, clc, format compact
p = [0 1 0 1 ; 0 0 1 1];
t = [0 1 1 0];
net = feedforwardnet(2,'trainlm');
net.trainParam.goal = 0.01*var(t',1);
net.divideFcn = 'dividetrain';
net = train(net,p,t);
a = net(p)

works almost perfectly, net.divideFcn = 'dividetrain'; helped

but 1 of 10 experiment network randomly falls into some local minimum and cant get out. Number of iteration goes to 250 - 500 iteration and breaks on minimum gradient reached, untrained.

Same result i get with net = patternnet(2,'trainlm'); instead net = feedforwardnet(2,'trainlm');

i think the reason is bad initial weights maybe

anyway this is much better then it was. Thanx again.

Greg Heath 2013년 2월 17일

편집: Greg Heath 2013년 2월 17일

MATLAB Online에서 열기

>net = feedforwardnet(2,'trainlm');

net = patternnet(2); % for classification

net = fitnet(2); % for regression

net = feedforward(2); % NEVER

 >net = train(net,p,t);
 >a = net(p)

[ net, tr, a ] = train(net,p,t);

NMSE = tr.perf/var(t')

R2 = 1- NMSE

 >but 1 of 10 experiment network randomly falls into some local minimum and cant get out. Number of iteration goes to 250 - 500 iteration and breaks on minimum gradient reached, untrained.

No fault on your part. This is normal. That is why precisely why you have to design multiple nets.

In general, you would rank the nets by their validation error and predict generaliztion error by using the test set error on the best net chosen by the validation set.

댓글을 달려면 로그인하십시오.

Answer 2

Albert 2013년 2월 16일

0 개 추천

ive just tried expand p and t, just copyed digits 10 times

now network learning, but i forced to set

max_fail = 100 (validation check)

and

min_grad = 1e-15 (minimum gradient)

otherwise training process still breaks (((

댓글 수: 2
없음 표시 없음 숨기기

Greg Heath 2013년 2월 17일

편집: Greg Heath 2013년 2월 17일

MATLAB Online에서 열기

Never use max_fail above 10.

Do you understand it's function?

You don't need to change min_grad ... Something else is wrong.

Albert 2013년 2월 17일

max_fail prevents overlearning i guess? i think big max_fail value gives effect like i turn off validation check.

now i use max_fail=6, thanx to you - net.divideFcn = 'dividetrain'; helped very well. no more validation check breaks.

min_grad sometimes breaks training when network get into local minimum. Now it is not necessary. Now i use default min_grad.

댓글을 달려면 로그인하십시오.

Answer 3

Albert 2013년 2월 17일

0 개 추천

Thank you Greg for thorough explanation.

much information for further analysis! )

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Answer 4

Sarita Ghadge 2017년 9월 15일

0 개 추천

clc; close all; clear all;

P=[0 0 1 1; 0 1 0 1]; T=[0 1 1 0];

net= feedforwardnet(200);% 200-hidden layer

net.trainFcn = 'trainbr';

net.divideFcn = 'dividetrain';

[net, tr]= train(net,P,T)

a=net(P(:,1))

a=net(P(:,2))

a=net(P(:,3))

a=net(P(:,4))

it works for exor using feedforwardnet with >=150 hidden layer

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Answer 5

ga 2024년 5월 21일

0 개 추천

Train the neural network using a two-input XOR gate knowing the initial values:

w1 = 0.9;

w2 = 1,8;

b = - 0.9;

Requirements achieved:

Analyze the steps to train a perceptron neural network.

Training programming using Matlab software.

Use nntool for survey and analysis

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

How to train feedforward network to solve XOR function

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 2
없음 표시 없음 숨기기

추가 답변 (4개)

댓글 수: 2
없음 표시 없음 숨기기

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

카테고리

제품

태그

Community Treasure Hunt

How to train feedforward network to solve XOR function

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 2 없음 표시 없음 숨기기

추가 답변 (4개)

댓글 수: 2 없음 표시 없음 숨기기

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

카테고리

제품

태그

참고 항목

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 2
없음 표시 없음 숨기기

댓글 수: 2
없음 표시 없음 숨기기

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기