LSTM Python hyperparameters v MATLAB

Question

Philip Hua 2022년 5월 30일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1730370-lstm-python-hyperparameters-v-matlab

답변: Philip Hua 2022년 6월 3일

I am reading a LSTM research paper and it states:

The following experiments investigate deep RNN models parameterized by the following hyperparameters: 1. num_layers – the number of memory cell layers 2. rnn_size – the number of hidden units per memory cell (i.e. hidden state dimension) 3. wordvec – dimension of vector embeddings 4. seq_length – number of frames before truncating BPTT gradient

I can see 2 and 3 as the number of hidden units and the input size but I cannot find where one would set 1 and 4.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

David Willingham 2022년 6월 1일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1730370-lstm-python-hyperparameters-v-matlab#answer_975985

Hi Philip,

For 1, by default layers are not a "settable" parameter. You need to setup an experiment that tests networks of different sizes and see which one might give the best results. This example Try Multiple Pretrained Networks for Transfer Learning shows how you can use the Experiment Manager App in MATLAB to do this.

For 4, while I don't have an example to share on this. You could use Experiment Manager to setup an experiment that changes the sequence length of the input data used to feed the LSTM training.

댓글 수: 2
없음 표시없음 숨기기

Philip Hua 2022년 6월 1일

hi David,

Thank you for your help and suggestion. The author already tried different settings (in Python RNN) and came up with the the "optimal" set of hyper-parameters. Is there any plan for Mathworks to include these rather basic options?

David Willingham 2022년 6월 1일

For 1, in MATLAB this isn't a settable parameter, however you can set them manually:

[lstmLayer(64); lstmLayer(64)]

For 4, there is an option to set sequencelength for the mini batch in the trainingoptions:

https://www.mathworks.com/help/deeplearning/ref/trainingoptions.html

댓글을 달려면 로그인하십시오.

Answer 2

Philip Hua 2022년 6월 3일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1730370-lstm-python-hyperparameters-v-matlab#answer_977570

Thank you David. Could you however, clarify the suggested network configuration above? The number of memory cells i think is not the same as the number of lstm layers right? Perhaps you could kindly send an unrolled network diagram and label the suggested configuration ?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

LSTM Python hyperparameters v MATLAB

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (2개)

댓글 수: 2
없음 표시없음 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

LSTM Python hyperparameters v MATLAB

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (2개)

댓글 수: 2 없음 표시없음 숨기기

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 2
없음 표시없음 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기