Reinforcement Learning

InfoBot: Transfer and exploration via the information bottleneck (Apr 2019)
Using information bottleneck (variational encoder) in agent to improve generalization and exploration of states.

Exploration by Random Network Distillation [code] (Oct 2018)
RND incentivizes visiting unfamiliar states by measuring how hard it is to predict the output of a fixed random neural network on visited states.

Self-Imitation Learning (Jun 2018)
Self-imitation learning improves actor-critic (A2C) agents to explore deeper.

Imagination-Augmented Agents for Deep Reinforcement Learning (Feb 2018)
Imagination-Augmented Agents (I2As) learns to interpret predictions from a learned environment model to construct implicit plans in arbitrary ways, by using the predictions as additional context in deep policy networks.

Proximal Policy Optimization Algorithms [code] (Jul 2017)

How to add a paper / dataset:

Check in which category the paper fits
Check in which subcategory the paper fits (create a new one if needed)
Add the title, link, the month and year it was published, a link to the code if exits and the contribution of the paper. Papers should be sorted by more recent first in each category. Example:

Examples:

Title of the paper [code] (Jun 2018)
A couple of lines describing the main contribution of the paper. Do not copy the abstract or write more than 2 lines in order to keep the wiki tidy.

Title of the paper (Jan 2018)
A couple of lines describing the main contribution of the paper. Do not copy the abstract or write more than 2 lines in order to keep the wiki tidy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reinforcement Learning

How to add a paper / dataset:

Examples:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Home

Categories:

Datasets

Clone this wiki locally