Useful resources relating to Reinforcement Learning
Pathmind AI Wiki: A Beginner's Guide to Deep Reinforcement LearningOpenAI PPO Baselines: Provides short explanation of PPO, with baselines of PPO, PPO2, ACER, and TRPO.Kaparthy RL: Andrej Kaparthy's Thoughts on RLLil'LogLilian Weng's Blog, all things RLHugging FaceIllustrating Reinforcement Learning from Human Feedback (RLHF)Hugging FaceIntroducing Decision Transformers on Hugging Face 🤗DeepmindOptimising computer systems with more generalised AI tools
Hugging Face Deep-RL: Course on Deep-RLStanford CS234: Stanford course on RLBerkeley Deep-RL: Berkeley course on RLKaggle: Quick courses on a variety of data science and ML topicsOpenAI Spinning Up: Educational resource produced by OpenAI that makes it easier to learn about Deep-RLUCLxDavid-Silver RL: David Silver teaches RL at UCL in 2015DeepMind YT Playlist: A series of lectures by DeepMind
- (ICML'15): Trust Region Policy Optimization (TRPO)
PDF - Solving Large Imperfect Information Games Using (CFR+)
PDF - MCCFR: Monte Carlo Sampling for Regret Minimization in Extensive Games
PDF - Pluribus: Superhuman AI for multiplayer poker
PDFMeta AI Blog - ReBeL: Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
arXivGithub - AlphaStar: Mastering the real-time strategy game StarCraft II
Deepmind Blog - Negotiation and honesty in artificial intelligence methods for the board game of Diplomacy
Deepmind BlogNature - Mastering Stratego, the classic game of imperfect information
Deepmind BlogDeepNash PaperScience ArticleDeepNash Code
Lux AI: Multi-agent RL, unit navigation, resource gathering, macro and micro strategies, multi-variable optimization, resource gathering , and allocation problem
MSAIL: Michigan Student Artificial Intelligence Lab