RL Environment: get obs from last episode

Question

Katharina Schmidt 2021년 8월 24일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1439889-rl-environment-get-obs-from-last-episode

답변: Katharina Schmidt 2021년 8월 25일

채택된 답변: Katharina Schmidt

MATLAB Online에서 열기

Hi,

I defined a Reinforcement Learning Environment based on the rlCreateEnvTemplate.

How can I limit the change of the actions, which are choosen by the agent while having a predefined action range (-50V<action<150V) ? (in my case I have voltages as actions.)

I think about something like this:

abs(action(i-1)-action(i)) < 10

for step i. But I don't know how to access the action from the previous step (which would be action(i-1)).

Another approach would be to use the change in voltage as action and then add this change to the voltage from the previous step. Again I would have to access a value from the previous step and I don't know how to get this value.

Thank you for any advice :)