Releases · masus04/Deep-Reinforcement-Learning-for-Boardgames

16 Sep 18:21

masus04

a9c96b0

Unified Experiments Pre-release

Pre-release

0.65

Preparation

Assets 2

15 Sep 13:54

masus04

0.6

b5026ea

Major network updates Pre-release

Pre-release

Make use of LogSoftmax().exp() for numerically stable and non spiking LegalSoftmax module
Fixes TTT Baseline Player's loss function bug
Improved plotting

Assets 2

19 Aug 08:08

masus04

0.55

5c83c8b

Major TicTacToe fix Pre-release

Pre-release

Resolved config file issue which impacts all TicTacToe experiments. All TicTacToe experiments are run after this point.

Assets 2

11 Aug 21:01

masus04

0.5

6126e5f

Major Functionality release Pre-release

Pre-release

Includes major functionality:

Framework
TicTacToe & Othello
Reinforce, Baseline & Actor Critic players
Search player
GUI

-> All experiments are based on this release. Will create new release if learning players are changed.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Releases: masus04/Deep-Reinforcement-Learning-for-Boardgames

Unified Experiments

Uh oh!

Major network updates

Uh oh!

Major TicTacToe fix

Uh oh!

Major Functionality release

Uh oh!