Skip to content

AzerAfram/Muzero_reproduction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

The AlphaGo (paper), AlphZero (paper), and Muzero (paper) algorithms created by Deepmind were made to solve complex tasks comparable to those met in the real world. It so happens that games are an accurate representation of some of those tasks, and so, games are a great way to test the learning capabilities as well as the generality of these models. The Muzero algorithm is the compilement of its predecessors (AlphaGo and AlphaZero) which are inferior to it in both accuracy and robusticity. The template was used here to reproduce the Muzero algorithm usng the multiprocessing module, which I recommend be swapped according to one's needs (TPU).

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages