quadrotor control with Q-learning

조회 수: 1 (최근 30일)
Stevy Kuimi
Stevy Kuimi 2018년 12월 21일
Hello,
I am trying to stabilise a quadrotor using q-learning and no simulink.
I have come accross some documentations in which the whole system was subdivided into four subsystems. One for the longitudinal, lateral,yaw and altitude. Each one on their continous state space form. I was told that if I manage the stabilise one of the subsystems, I would be able to stabilise the whole.
For example I am working on the longitudinal subsystem. And the state space matrices are given below. X(k+1)=AX(k)+BU(k)
Now before I start implementing anything, I started by discretizing the system, the applying the Q-learning, but i cannot manage to reach convergence. Does anyone has an idea on how I can speed up the learning and reach convergence?
Thank you
A=[0 1 0 0;
0 0 9.8 0;
0 0 0 1;
0 0 0 0];
B=[0 ;0 ;0 ;Ix];
goal=[0 0 0 0];
if s==goal % where s is the next state
reward= 100
else
reward=-1*abs(x1)-1*abs(t1)-0.1*abs(t2)-0.1*abs(x2) % where x1=x, x2=dx, t1= theta, t2= dtheta
end

답변 (0개)

카테고리

Help CenterFile Exchange에서 Modeling에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by