wumo / Reinforcement-Learning-An-Introduction
Kotlin implementation of algorithms, examples, and exercises from the Sutton and Barto: Reinforcement Learning (2nd Edition)
☆39Updated 3 years ago
Alternatives and similar repositories for Reinforcement-Learning-An-Introduction:
Users that are interested in Reinforcement-Learning-An-Introduction are comparing it to the libraries listed below
- Awesome RL: Papers, Books, Codes, Benchmarks☆115Updated last year
- research and implementations of Deep RL agents and their applications☆49Updated 3 weeks ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- ☆26Updated 6 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 4 years ago
- Bandits Environments for the OpenAI Gym☆90Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆101Updated last month
- Pytorch implementation of Soft Actor-Critic☆18Updated 4 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- PyTorch implementation of CommNet☆36Updated 7 years ago
- ☆18Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- ☆73Updated 8 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆135Updated 6 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆49Updated 2 years ago
- FEN Code☆37Updated 5 years ago
- MAGNet: Multi-agents control using Graph Neural Networks☆130Updated 5 years ago
- ☆73Updated 2 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆50Updated 6 months ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- ☆76Updated 7 years ago
- Basic reinforcement learning implementation with tensorflow version 2.0☆52Updated 4 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆44Updated 2 years ago
- ☆92Updated 4 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆40Updated 6 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆31Updated 5 years ago
- Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space☆46Updated 7 years ago
- Simple bit flipping with sparse rewards using HER, similarly to the original paper☆39Updated 5 years ago
- PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)☆73Updated 4 years ago