Using time as a negative reward in RL toolbox

2022 2월 24

1 답변

업데이트 시간: 2023 11월 30

조회 수: 99 (30일)

이 질문에 답변하려면 로그인하십시오.

Follow Question

이 질문에 답변하려면 로그인하십시오.

Follow Question

이전 댓글 표시

MATLAB Online에서 열기

0 개 추천

I want to use RL toolbox to train a DQN agent. Right now, i'm using the related step_function to implement the reward function. The problem is I don't know how to punish the agent for taking too long to do the objective. How should I add time to my reward function in this toolbox? Your help is appreciated.

function [NextObs,Reward,IsDone,LoggedSignals] = WW6_StepFunction_genloss(Action,LoggedSignals)
a = Action;
obj=4;
d=[1 2];
state = LoggedSignals.State;
[next_state, ~, genloss]=attack_eff_WW6(state, a, d);
LoggedSignals.State = next_state;
NextObs = LoggedSignals.State;
Down=nnz(~next_state);
IsDone = Down==11;
Reward=genloss;
end

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Follow Question

답변 (1개)

Kartik Saxena 2023년 11월 30일

0 개 추천

Hi,

I understand that you want to add time penalty in the reward function to punish it for taking too long.

The example given below in the MathWorks documentation would be useful for this purpose:

https://www.mathworks.com/help/reinforcement-learning/ug/create-matlab-environments-using-custom-functions.html

You can refer to it and introduce penalty in your reward function by deducting from the reward as per your requirements, instead of adding '1'.

I hope this resolves your issue.

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

카테고리

도움말 센터 및 File Exchange에서 Reinforcement Learning Toolbox에 대해 자세히 알아보기

제품

MATLAB

릴리스

R2021b

태그

2022년 2월 24일

2023년 11월 30일

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Translated by