How do I define a continuous reward function for RL environment?
이전 댓글 표시
I am trying to follow the double integrator example for giving a continuous reward function. When I used the custom template, and defined the reward using the QR cost function, I get an error stating that the reward should be a scalar value. Where can I find the property of reward and change it to accept vector values?
댓글 수: 3
Emmanouil Tzorakoleftherakis
2020년 10월 12일
Not sure why you want the reward to be scalar. Typically, rewards are treated as cost functions - they output a scalar value. If you have more than one states, you can turn it into a scalar using e.g. an l2 norm for example/some distance metric.
Prashanth Chivkula
2020년 10월 12일
Emmanouil Tzorakoleftherakis
2020년 10월 12일
That's right
채택된 답변
추가 답변 (0개)
카테고리
도움말 센터 및 File Exchange에서 Environments에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!