Reinforcement Learning toolbox step function

조회 수: 7 (최근 30일)
Mostafa Nazmi
Mostafa Nazmi 2020년 9월 7일
댓글: Kamalova Albina 2022년 2월 21일
Greetings everyone, I hope you're having a good time. In reinforcement learning toolbox there's a functin named "step(env, Action)", I wanted to know what is the role of the input "Action" in this function?
[Observation, Reward, IsDone, LoggedSignals] = step(env, Action)

채택된 답변

Stephan
Stephan 2020년 9월 7일
편집: Stephan 2020년 9월 7일
The action the agent has choosen in the last step, usually has an impact on the environment. To let the step function know what action was choosen the step before, you have to refer the last action to the next call of the step function, which then - based on this informations calculates the next observation, the reward and the iSDone flag.
See this example:
In the example given in the link above the action is a directed force that is applied to the system in the following step to calculate the new observations from the current step.
Building on that the step function can calculate the reward and if the IsDone value is true. Using these informations the agent gets a new information from the environment, which is the basis for the choice of the next action.
  댓글 수: 3
Maha Mosalam
Maha Mosalam 2021년 11월 22일
Hi, what about the xact role of IsDone flag it it shuld be true or false or what?
Kamalova Albina
Kamalova Albina 2022년 2월 21일
IsDone flag means the episode is finished or not. It should have a condition logic. For example, let's say you are hungry and you decide to eat something. In step function, you are continuously eating while do the actions to choose fry potato or tomato (maybe). How to know you are done and full already?! IsDone is this flag for showing you should stop this eating episode

댓글을 달려면 로그인하십시오.

추가 답변 (0개)

카테고리

Help CenterFile Exchange에서 Reinforcement Learning에 대해 자세히 알아보기

제품


릴리스

R2019b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by