What does SequenceLength property in the training options for an LSTM layer really mean and why is it there?

Question

kinblu 2018년 9월 19일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/419691-what-does-sequencelength-property-in-the-training-options-for-an-lstm-layer-really-mean-and-why-is-i

댓글: Mohamad Mustafa 2020년 6월 5일

Hello,

I am trying to understand the different terminologies used for creating an lstmLayer in MATLAB. I finally understood that numHiddenUnits parameter is the number of LSTM "cells" and the higher it is, the "longer" the network is. So, as far as I am concerned, for e.g. numHiddenUnits=100, the network always takes 100 time steps of the data for each training iteration. So according to my logic, I cannot find any use for SequenceLength.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Ieuan Evans 2018년 9월 27일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/419691-what-does-sequencelength-property-in-the-training-options-for-an-lstm-layer-really-mean-and-why-is-i#answer_338747

Hi,

Indeed, the software "unrolls" the layer to have length given by 'SequenceLength'. The network is stateful, so it also updates the network state between split sequences.

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

kinblu 2018년 9월 27일

Thank you :)

I think such information should be clearly stated in the documentation of the LSTM layer as they concern some important aspects of the whole LSTM concept, which might not be that intuitive for many.

댓글을 달려면 로그인하십시오.

Answer 2

Ieuan Evans 2018년 9월 25일

3
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/419691-what-does-sequencelength-property-in-the-training-options-for-an-lstm-layer-really-mean-and-why-is-i#answer_338307

편집: Ieuan Evans 2018년 9월 25일

Hi,

The number of hidden units corresponds to the amount of information remembered between time steps (the hidden state). The hidden state can contain information from all previous time steps, regardless of the sequence length. If the number of hidden units is too large, then the layer might overfit to the training data.

The hidden state does not limit the number of time steps are processed in an iteration. To split your sequences into smaller sequences for training, use the 'SequenceLength' option in trainingOptions.

If you specify the sequence length as a positive integer, then the software pads the sequences in each mini-batch to have the same length as the longest sequence, then split into smaller sequences of the specified length. If splitting occurs, then the function creates extra mini-batches.

You can use this option if the full sequences do not fit in memory. Alternatively, try reducing the number of sequences per mini-batch by setting the 'MiniBatchSize' option to a lower value.

If you specify the sequence length as a positive integer, then the software processes the smaller sequences in consecutive iterations. The network also updates the network state between the split sequences.

댓글 수: 2
없음 표시없음 숨기기

kinblu 2018년 9월 25일

Hi,

Thank you for your detailed answer. There is still a bit of detail which I don't fully get and hope you can quickly clear it for me if you please. As I am using MATLAB for my mini thesis.

So as I understand from your answer the number of hidden units influences the size of the shared weight matrix between the recurrent "cells". One ambiguity which still confuses me is the question of how does MATLAB determine the number of those "cells". I have the following assumption and would like you to let me know if it was correct:

If SequenceLength is set to a fixed size , then Matlab would choose that length as the number of cells.
If SequenceLength is set to 'shortest' or 'longest' then the corresponding length would be the number of cells.

A second thing is that, I would like to know whether MATLAB makes use of stateful LSTMs, i.e. does it reset the state of the LSTM after every mini-batch or does it do so only between mini-batches not belonging to the same, longer sequence?

Mohamad Mustafa 2020년 6월 5일

Hallo ,

I have the same Problem . please see the this link

https://www.mathworks.com/matlabcentral/answers/540095-what-is-the-number-of-lstm-cells-in-the-network?s_tid=prof_contriblnk

Thanks

댓글을 달려면 로그인하십시오.

Answer 3

MB Sylvest 2019년 3월 11일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/419691-what-does-sequencelength-property-in-the-training-options-for-an-lstm-layer-really-mean-and-why-is-i#answer_364877

Dear Ieuan Evans

It is not fully clear to us how the implementation of LSTM is i matlab. LSTM is well established in Keras. Hence for us to use Matlab, we really need some more information. Could you please clarify the following:. I think is is better done with an example: https://uk.mathworks.com/help/deeplearning/examples/time-series-forecasting-using-deep-learning.html

Could you clarify:

1) what is the apparent minibatch size in above example as changing the minibatch size has no effect? I just guess that there is only one mini batch that is equal to the full size of the sequence (longest)?

2) in the above example, is this a stateful implmentation? and is it always stateful implementations?

Kind regards Mads

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Ieuan Evans 2019년 3월 14일

Hi,

In the forecasting example, there is only one observation (a single time series) so the mini-batch size setting has no effect. If the total number of observations is less than or equal to the mini-batch size, then the network processes all the observations in a single iteration.

When specifying the SequenceLength option as an integer, the network is stateful. The network does not reset the state when mini-batches have been created by splitting the observations. The network resets the state when the mini-batch contains a new set of observations.

When the SequenceLength option is 'shortest' or 'longest', then the network is stateless. The network resets the state when the mini-batch contains a new set of observations.

댓글을 달려면 로그인하십시오.

What does SequenceLength property in the training options for an LSTM layer really mean and why is it there?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (2개)

댓글 수: 2
없음 표시없음 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

What does SequenceLength property in the training options for an LSTM layer really mean and why is it there?

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (2개)

댓글 수: 2 없음 표시없음 숨기기

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 2
없음 표시없음 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기