Reinforcement Learning Toolbox - When does algorithm train?

Question

Hans-Joachim Steinort 2019년 9월 17일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/480728-reinforcement-learning-toolbox-when-does-algorithm-train

댓글: Hans-Joachim Steinort 2019년 9월 26일

채택된 답변: Emmanouil Tzorakoleftherakis

I am currently using the RL-Toolbox with a DQN-Agent built into a long-running process-simulation.

The maximum stepcount is currently 8000 steps per episode.

Unfortunately the documentation seems a little ambiguous to me, so here my question:

Doese the train-function of the RL-Toolbox train the agent at the end of an episode or during the episode when the step count exeeds the minibatch-size (like in the baseline algorithms)?

Thank you in advance.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Emmanouil Tzorakoleftherakis 2019년 9월 25일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/480728-reinforcement-learning-toolbox-when-does-algorithm-train#answer_393529

The implementation is based on the algorithm listed here.

Weights are being updated at each time step.

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Hans-Joachim Steinort 2019년 9월 26일

"For each training time step" - that was the line I was looking for (yet looking into the source code lead me to the same conclusion).

After double-checking the baseline-algorithms I found that they do it the same way.

Thank you for your time!

댓글을 달려면 로그인하십시오.

Reinforcement Learning Toolbox - When does algorithm train?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

Reinforcement Learning Toolbox - When does algorithm train?

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기