GitHub

I am just trying to learn reinforcement learning

THe initial state is at a random position from limx->0+ to limx->10-

My state includes the car's position from the left wall and the angle of movement THe action is to change the car's angle

THe linear model's output is the mean and log(std) from which I sample from

The end is after 100 sucessful steps

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
.gitignore		.gitignore
README.md		README.md
model.py		model.py
train.py		train.py

Provide feedback