Controlling state transition using transition probability matrix in Markov chain?

조회 수: 10 (최근 30일)

chaaru datta 2022년 12월 6일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1872317-controlling-state-transition-using-transition-probability-matrix-in-markov-chain

편집: chaaru datta 2022년 12월 8일

Hello all, I am working on Markov chain and in that I would like control the state transition using transition probability matrix (TPM).

The TPM in my case is 6x6 and is given as TPM = [0.6,0.4,0,0,0,0;0.3,0.4,0.3,0,0,0;0,0.3,0.4,0.3,0,0;0,0,0.3,0.4,0.3,0;0,0,0,0.3,0.4,0.3;0,0,0,0,0.4,0.6]. For clarity, let us denote the state at time t by the row and state at time t+1 by the column of TPM.

I understood that if my current state is 2 and if transition probability is 0.4 then from TPM my next state will also be 2.

But my query is how this condition of transition probability of value 0.4 is generated ?

Any help in this regard will be highly appreciated.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

답변 (1개)

Torsten 2022년 12월 6일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1872317-controlling-state-transition-using-transition-probability-matrix-in-markov-chain#answer_1121562

편집: Torsten 2022년 12월 6일

I understood that if my current state is 2 and if transition probability is 0.4 then from TPM my next state will also be 2.

That's wrong. If you are in state 2, the probability to change in state 1 is 0.3, to remain in state 2 is 0.4 and to change in state 3 is 0.3.

But my query is how this condition of transition probability of value 0.4 is generated ?

The transition probabilities are not generated, but they are fixed and calculated in advance depending on what your Markov chain tries to model.

댓글 수: 6
이전 댓글 4개 표시이전 댓글 4개 숨기기

chaaru datta 2022년 12월 7일

Ok Thank you sir. But that again creates little confusion to me about Q-learning with Markov chain.

chaaru datta 2022년 12월 8일

편집: chaaru datta 2022년 12월 8일

My query is I am not getting about what value of next state

should we put in Bellman equation (used in q-learning) if current state

, where Bellman equation is:

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

카테고리

Computational Finance Econometrics Toolbox Regime-Switching Models Markov Chain Models

Help Center 및 File Exchange에서 Markov Chain Models에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by