Need help with Sequence input Deep Learning toolbox error

Question

Eva 2023년 6월 16일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1984309-need-help-with-sequence-input-deep-learning-toolbox-error

댓글: Ben 2023년 6월 20일

I am trying to achieve a script to execute sequence-to-label(response) classification.

I am having a problem with how the function trainNetwork wants the input data packaged.

I am trying to figure out what to do with XTrain or XTrainTest, to get the 'trainNetwork' command to accept the input data.

I am receiving the following error: 'Error using trainNetwork

Invalid training data. Predictors must be a numeric array, a datastore, or a table. For networks with sequence input, predictors can also be a cell array of sequences. '

----------------------------------------------------------------------------------------------------------------------------------------------------------------------

clear all;

close all;

clc

%--------------------------------------------------------------------------

% Generating A matrix

NewDimension = uint16(200);

mu = 5;

std = 2.1;

A = normrnd(mu, std, [NewDimension 784] );

%--------------------------------------------------------------------------

N_sample = 60000;

N_test=10000;

%XTrain = zeros(28,28,1,N_sample);

XTrain = zeros(1,NewDimension,1,N_sample);

XTrainTest = cell(N_sample,1);

YTrain=zeros(N_sample,1);

% Please dowload the MNIST data set from http://yann.lecun.com/exdb/mnist/

% and unzip.

fidimg1=fopen('train-images.idx3-ubyte','rb');

fidimg2=fopen('train-labels.idx1-ubyte','rb');

[img,count]=fread(fidimg1,16); % table head

[imgInd,count1]=fread(fidimg2,8); %table head

for k=1:N_sample

[im,~]=fread(fidimg1,[28,28]);

ind=fread(fidimg2,1);

clear testsave, clear testmult

testsave = reshape(im, 784, 1);

testmult = A*testsave;

XTrain(1,1:NewDimension,1,k)=transpose(testmult);

XTrainTest{k} = testmult';

%XTrainTest{k} = double(transpose(testmult));

%XTrain(:,:,1,k)=im';

YTrain(k)=ind;

end

fclose(fidimg1);

fclose(fidimg2);

YTrain=categorical(YTrain);

%-------------------------------------------------------------------------

XTest = zeros(1,NewDimension,N_test);

%XTest = zeros(1,784,1,N_test);

YTest=zeros(N_test,1);

fidimg1=fopen('t10k-images.idx3-ubyte','rb');

fidimg2=fopen('t10k-labels.idx1-ubyte','rb');

[img,count]=fread(fidimg1,16);

[imgInd,count1]=fread(fidimg2,8);

for k=1:N_test

[im,~]=fread(fidimg1,[28,28]);

ind=fread(fidimg2,1);

clear testsave, clear testmult

testsave = reshape(im, 784, 1);

testmult = A*testsave;

XTest(1,1:NewDimension,k)=transpose(testmult);

%XTest(:,:,1,k)=im';% training set building

YTest(k)=ind;

end

fclose(fidimg1);

fclose(fidimg2);

YTest=categorical(YTest);

numHiddenUnits = 400;

numClasses = 10;

numFeatures = NewDimension;

% Modify XTrain and XTest to be 2-D arrays

XTrain = reshape(XTrain, [NewDimension, N_sample]);

%YTrain = transpose(YTrain);

%XTest = reshape(XTest, [NewDimension, N_test]);

layers = [

featureInputLayer(numFeatures)

fullyConnectedLayer(100) % number of neurons in the first hidden layer

batchNormalizationLayer

reluLayer

fullyConnectedLayer(50) % number of neurons in the second hidden layer

batchNormalizationLayer

reluLayer

fullyConnectedLayer(numClasses)

softmaxLayer

classificationLayer];

maxEpochs = 5;

miniBatchSize = 20;

options = trainingOptions('adam', ...

'ExecutionEnvironment','cpu', ...

'MaxEpochs',maxEpochs, ...

'MiniBatchSize',miniBatchSize, ...

'GradientThreshold',1, ...

'Verbose',false, ...

'Plots','training-progress');

net = trainNetwork(XTrainTest,YTrain,layers,options);

댓글 수: 3
이전 댓글 1개 표시이전 댓글 1개 숨기기

Eva 2023년 6월 16일

I think I seemed to discover the fault in this simulation, and it had to do with my layer stack up, it was defined for featureinputLayers and not for sequenceInputLayer:

The following seems to allow it to train:

layers = [

sequenceInputLayer(numFeatures)

convolution1dLayer(filterSize, numFilters, 'Padding', 'causal')

reluLayer

layerNormalizationLayer

convolution1dLayer(filterSize, 2*numFilters, 'Padding', 'causal')

reluLayer

layerNormalizationLayer

globalAveragePooling1dLayer

fullyConnectedLayer(numClasses)

softmaxLayer

classificationLayer

];

Ben 2023년 6월 20일

In general you should be able to use both featureInputLayer and sequenceInputLayer for this problem, by interpreting the image as either a vector 784 features, or a sequence of 784 features. But if you want to use convolution1dLayer you'll need to use sequenceInputLayer since convolution1dLayer operates over the sequence dimension.

For the datastores for sequence data, this example might be a good starting point using only built-in datastores, and this example includes a custom sequenceDatastore implementation that you can look at to get an idea of how you might write your own datastore for this task.

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Need help with Sequence input Deep Learning toolbox error

댓글 수: 3
이전 댓글 1개 표시이전 댓글 1개 숨기기

답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

Need help with Sequence input Deep Learning toolbox error

댓글 수: 3 이전 댓글 1개 표시이전 댓글 1개 숨기기

답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 3
이전 댓글 1개 표시이전 댓글 1개 숨기기