Reinforcement Learning Toolbox: Not enough Room in buffer
이전 댓글 표시
Problem was a missunderstanding of an example. So this error is caused by an user error that was explained in one of the comments to this issue if that comment does not apply to you, you most likely have a different issue.
Hello,
I am pretty new in the realm of RL and am using the RL Toolbox to controll a Simulink modell with the DDPG Agent.
I have 2 actions and 2 observations
My Problem is that everytime i try to train the agent I get the Error:
An error occurred while running the simulation and the simulation was terminated
Caused by:
MATLAB System block 'rlMockLoop/RL Agent/AgentWrapper' error occurred when invoking 'outputImpl' method of 'AgentWrapper'. The error was thrown from '
'/usr/local/MATLAB/R2019a/toolbox/rl/rl/+rl/+util/ExperienceLogger.m' at line 30
'/usr/local/MATLAB/R2019a/toolbox/rl/rl/+rl/+agent/AbstractPolicy.m' at line 95
'/usr/local/MATLAB/R2019a/toolbox/rl/rl/simulink/libs/AgentWrapper.m' at line 107'.
Not enough room in the buffer to store the new experiences. Make sure the bufferSize argument is big enough.
I tried to increase the agentOption ExperienceBufferLength (even to pretty high values).
Is that even the right Option I should be looking at or am I missing something?
Code snippets:
Ts ~ 0.05
actionInfo = rlNumericSpec([2 1],...
'LowerLimit',[0 0]',...
'UpperLimit',[100 100]');
actionInfo.Name = 'StromstaerkeProzent';
actionInfo.Description = 'Aout, Ain';
%% Specify Observations
observationInfo = rlNumericSpec([2 1]);
actionInfo.Name = 'pressure';
actionInfo.Description = 'DruckWasser, Druck';
agentOpts = rlDDPGAgentOptions(...
'SampleTime',Ts,...
'TargetSmoothFactor',1e-3,...
'ExperienceBufferLength',512*((10/Ts)*1000),...
'DiscountFactor',0.99,...
'MiniBatchSize',512);
agent = rlDDPGAgent(actor,critic,agentOpts);
trainingOptions = rlTrainingOptions(...
'MaxEpisodes',1000, ...
'MaxStepsPerEpisode',10/Ts, ...
'ScoreAveragingWindowLength',5,...
'Verbose',false, ...
'Plots','training-progress',...
'StopTrainingCriteria','AverageReward',...
'StopTrainingValue',-1100,...
'SaveAgentCriteria','EpisodeReward',...
'SaveAgentValue',-1100);
simOptions = rlSimulationOptions('MaxSteps',10/Ts);
experience = sim(env,agent,simOptions);
Other:
I tried to make the buffer size relative to the episode count and the length of 10s.
I really hope somebody can help me.
댓글 수: 6
Emmanouil Tzorakoleftherakis
2019년 7월 8일
Hi Clemens,
Can you share a repro model? I may be able to help if I can reproduce the error.
Clemens Fricke
2019년 7월 8일
Anh Tran
2020년 2월 21일
Feel free to open a new question since the new issue is not related to the original.
Saurav Sthapit
2020년 9월 17일
Did anyone solve this issue?
I am facing the same isue in R2019a when simulatng with sim(env,agent,simopts) after training the agent.
eyman ikhlaq
2021년 1월 3일
i am facing the same issue please send a solution
Aysegul Kahraman
2022년 1월 30일
Hi,
You can try to use delay block after the action.
It solved my problem although that was not my first choice to solve this issue.
채택된 답변
추가 답변 (0개)
카테고리
도움말 센터 및 File Exchange에서 Reinforcement Learning에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!