Matlab implementation of AlphaZero

조회 수: 6 (최근 30일)
Mohammad Gohardoust
Mohammad Gohardoust 2020년 1월 17일
댓글: John D'Errico 2020년 1월 19일
Hi,
Just out of curiosity, I'd like to know if anyone knows of a sort of ready to use AlphaZero algorithm implemented in Matlab?
  댓글 수: 2
Greg Heath
Greg Heath 2020년 1월 18일
What is the meaning of the term "AlphaZero" ???
Greg
Mohammad Gohardoust
Mohammad Gohardoust 2020년 1월 18일
Hi Greg, please see the answer by John D'Errico.

댓글을 달려면 로그인하십시오.

답변 (1개)

John D'Errico
John D'Errico 2020년 1월 18일
편집: John D'Errico 2020년 1월 18일
Now that I have some free time...
I'm pretty sure the answer is no.
Alphazero is a tool that was developed that could learn games like chess and go, merely by being given the rules and the end goal, to win. The tool would then play itself, trying different things, then learning how to become truly good at said game. It has achieved that goal to such an extent that in chess and go, (plus some others I cannot think of) it is now better than the human world champion by a considerable degree.
In fact, the human chess champion has typically had a rating on the order of 2800, sometimes flirting with 2900. The current reining chess code has usually been Stockfish, with a rating on the order of perhaps 3500 or so. The point being that a rating difference of 400 points of more means the higher individual will virtually never lose a game to the lower rated individual. So Stockfish should blow away the human chess champion, as we have seen it can do.
However, AlphaZero was able to dominate Stockfish, meaning that AlphaZero is sufficiently higher in rating than even the best A-B code.
I would note that Alphazero has NOT successfully been applied to bridge, my personal favorite mind sport. The difference in my eyes being that go and chess are games of complete information, whereas bridge is not. A large part of bridge lies in the realms of communication & even psychology. Bridge is not just pure probability. That makes it more difficult to master, not that I think it will not happen in the near future.
AlphaZero was developed by DeepMind.
As such it is proprietary to that company. However, there are now open source implementations of such a concept, for chess, in the form of Leela Chess Zero (LC0), and others. In the latest online computer chess championshoips, LC0 was in the top 2 ocmpetitors, though I don't recall the winner, and LC0 was still improving.
Again, none of them are based on MATLAB, at least that is my understanding, nor are there currently any such implementations that I have heard of. That does not mean there are none, written by some individual I have not yet heard about.
  댓글 수: 2
Mohammad Gohardoust
Mohammad Gohardoust 2020년 1월 18일
Thanks John for your answer. Added to your points, AlphaZero has been recetly employed for global optimization of quantum dynamics: here is the link to the article in nature.
John D'Errico
John D'Errico 2020년 1월 19일
Yes. I imagine the general idea will expand over time, as people find new applications for it. One step closer to Skynet perhaps...

댓글을 달려면 로그인하십시오.

카테고리

Help CenterFile Exchange에서 Image Data Workflows에 대해 자세히 알아보기

태그

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by