quadrotor control with Q-learning

조회 수: 1 (최근 30일)

이전 댓글 표시

Stevy Kuimi 2018년 12월 21일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/436735-quadrotor-control-with-q-learning

Quadrotor control.pdf

MATLAB Online에서 열기

Hello,

I am trying to stabilise a quadrotor using q-learning and no simulink.

I have come accross some documentations in which the whole system was subdivided into four subsystems. One for the longitudinal, lateral,yaw and altitude. Each one on their continous state space form. I was told that if I manage the stabilise one of the subsystems, I would be able to stabilise the whole.

For example I am working on the longitudinal subsystem. And the state space matrices are given below. X(k+1)=AX(k)+BU(k)

Now before I start implementing anything, I started by discretizing the system, the applying the Q-learning, but i cannot manage to reach convergence. Does anyone has an idea on how I can speed up the learning and reach convergence?

Thank you

A=[0 1  0  0;
   0 0 9.8 0;
   0 0  0  1;
   0 0  0  0];
B=[0 ;0 ;0 ;Ix];
goal=[0 0 0 0];
if s==goal % where s is the next state
   reward= 100
else
   reward=-1*abs(x1)-1*abs(t1)-0.1*abs(t2)-0.1*abs(x2) % where x1=x, x2=dx, t1= theta, t2= dtheta
end