Encountering problems in creating a Simulink interactive time-sequenced reinforcement learning environment.
이전 댓글 표시
I want to set up an online learning environment for PPO in Simulink, and the status input is 2*100 time series data, and I would like to ask how to implement it. observationInfo = rlNumericSpec([2 100]); When I use this code, I get an error.
错误使用 rl.internal.validate.mapFunctionObservationInput (第 50 行)
Model input sizes must match the dimensions specified in the corresponding observation and action info specifications.
出错 rlDiscreteCategoricalActor (第 86 行)
model = rl.internal.validate.mapFunctionObservationInput(model,...
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
出错 rl_demo (第 80 行)
actor = rlDiscreteCategoricalActor(actorNetwork,observationInfo,actionInfo);
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
채택된 답변
추가 답변 (0개)
카테고리
도움말 센터 및 File Exchange에서 Deep Learning Toolbox에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!