hartikainen / easy21Links
Reinforcement learning agents and environment for Easy21, a modified version of Blackjack
☆14Updated 8 years ago
Alternatives and similar repositories for easy21
Users that are interested in easy21 are comparing it to the libraries listed below
Sorting:
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Updated 8 years ago
- A high-performance Atari A3C agent in 180 lines of PyTorch☆173Updated 4 years ago
- Machine Learning Course Project Skoltech 2018☆108Updated 6 years ago
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆38Updated 4 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆97Updated 5 years ago
- A toy example of Policy Gradient implemented in Pytorch☆95Updated 7 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆29Updated 6 years ago
- Implementation of PPO in Pytorch☆41Updated 8 years ago
- Atari - Deep Reinforcement Learning algorithms in TensorFlow☆139Updated last year
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133Updated 6 years ago
- PyTorch implementation of Proximal Policy Optimization☆53Updated 8 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated last year
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆53Updated 5 years ago
- Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.☆100Updated 5 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Updated 7 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 7 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆32Updated 8 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- Modular PyTorch implementation of policy gradient methods☆25Updated 7 years ago
- Implementation of "Training Agents using Upside-Down Reinforcement Learning (https://arxiv.org/pdf/1912.02877.pdf)"☆17Updated 6 years ago
- C51-DDQN in Keras☆126Updated 8 years ago
- PyTorch implementation of both discrete and continuous ACER☆24Updated 6 years ago
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆103Updated 5 years ago
- Reinforcement Learning in Keras on VizDoom☆142Updated 8 years ago
- Proximal Policy Optimization in PyTorch☆39Updated 8 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆202Updated 5 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆47Updated 8 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Updated 8 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 7 years ago