ExperienceBufferLength in Reinforcement Learning Toolbox

조회 수: 12 (최근 30일)

qun wang 2021년 11월 15일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1587044-experiencebufferlength-in-reinforcement-learning-toolbox

댓글: Francisco Serra 2024년 5월 2일

Hello, everyone,

I found a problem with the 'ExperienceBufferLength' property in 'rlDDPGAgentOptions' when specifying options for rl agents.

Usually this property is set as 1e6 in the examples of the Help documentation, such as here.

In this example, every episode has 600 (60/0.1) steps. Does the agent start to train when the experience buffer is filled up with the experiences (S,A,R,S'). If so, it would take at least 1667 (1000000/600 ) episodes before the agent starts to improve.

So I want to know how to determine this value.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

채택된 답변

Ari Biswas 2021년 11월 17일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1587044-experiencebufferlength-in-reinforcement-learning-toolbox#answer_833904

The agent will train until at least one minibatch can be sampled from the buffer. If your mini batch size is 64, then the first learn step will occur after the buffer has stored 64 experiences. The experience buffer is circular, i.e., it removes older experiences when full. The size of the buffer is hence important. You may lose important experiences if the buffer size is too small.