Reinforcement learning: "NextObs" vs. "LoggedState" in step function
이전 댓글 표시
Hi,
I could not find out what the difference between "NextObs" and "LoggedSignals" is in the step function. In all scripts both are passed on from the step function.
[NextObs,Reward,IsDone,LoggedSignals] = myStepFunction(Action,LoggedSignals)
"LoggedSignals" is obviously used for the next step, but what is "NextObs" used for?
Thanks!
답변 (1개)
Emmanouil Tzorakoleftherakis
2020년 7월 27일
0 개 추천
Actually, NextObs is the important thing here. It represents the value of your states after you apply current action and integrating one step.
LoggedSignals is where you can log information to view later - can be left empty too.
댓글 수: 4
Anne Tscheliessnig
2020년 7월 28일
Emmanouil Tzorakoleftherakis
2020년 7월 28일
Oh you were looking at creating custom environments with functions - I was looking at creating environments with classes by running e.g.
rlCreateEnvTemplate('myenv')
where LoggedSignals is not that important since you can use class variables to store the states.
I suspect the reason you need both LoggedSignals and NextObs is to create a unified way of using custom environments regardless of how you create it. NextObs is probably what the agent is using when interacting with the environment, whereas LoggedSignals is a way to save intermmediate values if you don't use classes to create your custom env.
lfyx
2021년 11월 1일
Hello, may I ask that, can the "sim" function output the LoggedSignals to the work space? Many information about the simulation action or observarion are saved in the LoggedSignals. However, the output of "sim" is the experince structure.
Maha Mosalam
2021년 11월 22일
Hi, what about the xact role of IsDone flag it it shuld be true or false or what?
카테고리
도움말 센터 및 File Exchange에서 Simscape Electrical에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!