Skip to content

Releases: masus04/Deep-Reinforcement-Learning-for-Boardgames

Unified Experiments

16 Sep 18:21

Choose a tag to compare

Unified Experiments Pre-release
Pre-release
0.65

Preparation

Major network updates

15 Sep 13:54

Choose a tag to compare

Major network updates Pre-release
Pre-release
  • Make use of LogSoftmax().exp() for numerically stable and non spiking LegalSoftmax module
  • Fixes TTT Baseline Player's loss function bug
  • Improved plotting

Major TicTacToe fix

19 Aug 08:08

Choose a tag to compare

Major TicTacToe fix Pre-release
Pre-release

Resolved config file issue which impacts all TicTacToe experiments. All TicTacToe experiments are run after this point.

Major Functionality release

11 Aug 21:01

Choose a tag to compare

Pre-release

Includes major functionality:

  • Framework
  • TicTacToe & Othello
  • Reinforce, Baseline & Actor Critic players
  • Search player
  • GUI

-> All experiments are based on this release. Will create new release if learning players are changed.