Is it possible to change RL action values under certain conditions?

조회 수: 1 (최근 30일)

black_cat 2021년 5월 18일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/833083-is-it-possible-to-change-rl-action-values-under-certain-conditions

편집: black_cat 2021년 5월 20일

I want my agent to output a target value, but in certain situations (reward drops dramatically), I would want the agent to look for a better solution by letting him change the target value. I tried to use initial condition block in order to use the target value in the first place. However, my agent (PPO) always outputs an average value after some training episodes.

댓글 수: 5
이전 댓글 3개 표시이전 댓글 3개 숨기기

black_cat 2021년 5월 20일

편집: black_cat 2021년 5월 20일

I've tried to create a minimal version that illustrates my problem. Here, I'm outputing numbers from 1-3. I hope it's more understandable that way.

black_cat 2021년 5월 20일

편집: black_cat 2021년 5월 20일

Okay, even though the attached example is supposed to be easy to understand, I think I'm able to put my problem in simple terms now:

I'm training my agent to output 3 discrete values (1, 2, 3)
I punish him for not outputing my target value
My target value is 1 for 50% of the time and 3 for the other 50% of the time

When training the agent is done (no matter which one, they all act the same in this case), it will output 1 or 3. For 100% of the time. It's not changing the output values at all. It's just using one. This is my problem.

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

답변 (0개)

이 질문에 답변하려면 로그인하십시오.

카테고리

AI and Statistics Deep Learning Toolbox Applications Autonomous and Control Systems Reinforcement Learning

Help Center 및 File Exchange에서 Reinforcement Learning에 대해 자세히 알아보기

제품

릴리스

R2020b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Is it possible to change RL action values under certain conditions?

댓글 수: 5
이전 댓글 3개 표시이전 댓글 3개 숨기기

답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

Is it possible to change RL action values under certain conditions?

댓글 수: 5 이전 댓글 3개 표시이전 댓글 3개 숨기기

답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 5
이전 댓글 3개 표시이전 댓글 3개 숨기기