Multiagent_chainMDP

Environment needed:

python3.5, Tensorflow1.4.0, Numpy1.13.3, matplotlib2.1.0, Tensorboard0.4.0rc(unnecessary)

Both GPU or CPU for tensorflow are surpported.
References:
Need to be done:
- Multi thread to speed up
- More algorithm should be applied
- A more efficient exploration method to solve it.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
env		env
for_comparing		for_comparing
model		model
utils		utils
README.md		README.md
config.py		config.py
play.py		play.py
play_re.py		play_re.py

Provide feedback