Reinforcement learning on the board game Go
This codebase depends on the OpenAI gym environment GymGo. See the documentation for installation instructions
Actor Critic
python play.py --boardsize=9 --model=ac --temp=0.05 --mcts=81 --render=humanQ Learning
python play.py --boardsize=9 --model=val --temp=0.01 --mcts=8 --render=humanHuman rendering uses the Pyglet library to make a nice GUI for you. If you find that this doesn't work on your machine, try setting render to
terminalinstead
python3 train.py --boardsize=5See go_ai/utils.hyperparameters() to see what other hyperparameters you can modify