kvfrans / Easy21-RL
solutions to David Silver's RL course project Easy21
☆19Updated 8 years ago
Alternatives and similar repositories for Easy21-RL:
Users that are interested in Easy21-RL are comparing it to the libraries listed below
- Direct Future Prediction (DFP ) in Keras☆109Updated 7 years ago
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆53Updated 5 years ago
- Reinforcement learning agents and environment for Easy21, a modified version of Blackjack☆14Updated 7 years ago
- Bandits Environments for the OpenAI Gym☆90Updated 5 years ago
- Highly Modular and Scalable Reinforcement Learning☆114Updated 5 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆103Updated 4 years ago
- Combining deep learning and reinforcement learning.☆80Updated 3 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Generic reinforcement learning codebase in TensorFlow☆95Updated 3 years ago
- Meta Reinforcement Learning Experiments☆34Updated 7 years ago
- ☆68Updated 6 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆93Updated 4 years ago
- C51-DDQN in Keras☆126Updated 7 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 7 years ago
- Modular PyTorch implementation of policy gradient methods☆25Updated 6 years ago
- ☆46Updated 6 years ago
- Solving easy21 assigment from RL class by David Silver; A practical guide to get started with RL for beginners.☆18Updated 5 years ago
- some common TD Learning algorithms☆67Updated 5 years ago
- Deep RL Bootcamp solutions☆35Updated 7 years ago
- Machine Learning Course Project Skoltech 2018☆108Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- A PyTorch implementation of Human-Level Control through Deep Reinforcement Learning☆23Updated 7 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆132Updated 5 years ago
- Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.☆102Updated 5 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆52Updated 5 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- Implementation of Neural Episodic Control in Tensorflow☆26Updated 5 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆33Updated 8 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆23Updated 7 years ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆42Updated 6 years ago