reinforcement learning toolbox - q table

Question

Xinpeng Wang 2019년 7월 10일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table

답변: Tuong Nguyen 2022년 10월 7일

I'm a newbie to RL and the RL toolbox. I played with Q-learning agent with a model in simulink. My question is after training, How can I access to the trained Q table? The qTable used to generate the agent is all ZERO. I cannot figure out where the trained Q values and the policies are stored. Thank you!

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Emmanouil Tzorakoleftherakis 2019년 7월 23일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_384423

MATLAB Online에서 열기

Hi Xinpeng,

To see the trained table, you have to do is extract it using ‘getCritic’. Try:

critic = getCritic(agent);

The variable ‘critic’ has a field which contains the Qtable after training.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Answer 2

carlos pedreira 2020년 1월 13일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_409897

OK, but, after that, HOW CAN I SEE the table....

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Answer 3

Shikhar Sharma 2020년 1월 24일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_411873

It should appear under the Workspace tab.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Answer 4

Umut Can Akdag 2020년 5월 18일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_433182

For those who are still looking for the q table I think this is the solution.

critic = getCritic(agent);

qtable = getLearnableParameters(critic);

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Answer 5

RUBEN HERNANDEZ 2022년 4월 19일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_946135

Hi everyone

I want to simulate Q-learning agent for control inverted pendulum in simulink (with Q-table) (just for ilustrative example)

I've picked the rlsimplependulumModel.slx predefined in matlab.

This is my code

mdl = 'rlSimplePendulumModel';

open_system(mdl)

obsInfo = rlNumericSpec([3 1]); % vector of 3 observations: sin(theta), cos(theta), d(theta)/dt

actInfo = rlFiniteSetSpec([-2 0 2]); % 3 possible values for torque: -2 Nm, 0 Nm and 2 Nm

obsInfo.Name = 'observations';

actInfo.Name = 'torque';

agentBlk = [mdl '/RL Agent'];

env = rlSimulinkEnv(mdl,agentBlk,obsInfo,actInfo);

env.ResetFcn = @(in)setVariable(in,'theta0',pi,'Workspace',mdl);

Ts = 0.05; % simulation time

Tf = 20; % sample time

% Fix the random generator seed for reproducibility

rng(0)

%% To create a Q-learning agent:

%1 Create a critic using an rlQValueRepresentation object.

qTable = rlTable(obsInfo, actInfo);

qRepresentation = rlQValueRepresentation(qTable, obsInfo, actInfo);

qRepresentation.Options.LearnRate = 0.99;

%% 2 Specify agent options using an rlQAgentOptions object.

agentOpts = rlQAgentOptions;

agentOpts.DiscountFactor = 0.99;

agentOpts.EpsilonGreedyExploration.Epsilon = 0.9;

agentOpts.EpsilonGreedyExploration.EpsilonDecay = 0.01;

%% 3 Create the agent using an rlQAgent object.

qAgent = rlQAgent(qRepresentation,agentOpts);

%% Training Algorithm

% rlQAgentOptions.

trainOpts = rlTrainingOptions;

trainOpts.MaxStepsPerEpisode = ceil(Tf/Ts);

trainOpts.MaxEpisodes = 2000;

trainOpts.StopTrainingCriteria = "AverageReward";

trainOpts.StopTrainingValue = -740;

trainOpts.ScoreAveragingWindowLength = 5;

trainingStats = train(qAgent,env,trainOpts);

AND THIS IS THE ERROR MESSAGE

Error using rlTable/validateInput (line 131)

Input must be a scalar rlFiniteSetSpec.

Error in rlTable (line 51)

validateInput(obj, ObservationInfo)

Error in qlearningpendulum (line 30)

qTable = rlTable(obsInfo, actInfo);

any suggestions?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Answer 6

Tuong Nguyen 2022년 10월 7일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/471165-reinforcement-learning-toolbox-q-table#answer_1068895

I think to use tabular Q learning, your observation has to be discrete and finite. That means your obsInfo has to be rlFiniteSetSpec(allStates), where in "allStates" you list out all the possible observations. See https://www.mathworks.com/help/reinforcement-learning/ref/rltable.html for the rlTable and https://www.mathworks.com/help/reinforcement-learning/ref/rl.util.rlfinitesetspec.html for the rlFiniteSetSpec.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

reinforcement learning toolbox - q table

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

추가 답변 (5개)

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

reinforcement learning toolbox - q table

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

추가 답변 (5개)

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기