hartikainen / easy21Links
Reinforcement learning agents and environment for Easy21, a modified version of Blackjack
☆14Updated 8 years ago
Alternatives and similar repositories for easy21
Users that are interested in easy21 are comparing it to the libraries listed below
Sorting:
- solutions to David Silver's RL course project Easy21☆19Updated 8 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 9 months ago
- Policy gradient reinforcement learning algorithm with importance sampling☆32Updated 7 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆132Updated 6 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆28Updated 6 years ago
- A toy example of Policy Gradient implemented in Pytorch☆93Updated 7 years ago
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago
- Reinforcement Learning in Keras on VizDoom☆143Updated 7 years ago
- Reinforcement learning benchmarking.☆40Updated 6 years ago
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆53Updated 5 years ago
- PyTorch implementation of Proximal Policy Optimization☆52Updated 7 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆95Updated 2 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Updated 4 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 6 years ago
- Combining deep learning and reinforcement learning.☆80Updated 3 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆203Updated 4 years ago
- Implementation of PPO in Pytorch☆41Updated 7 years ago
- ☆35Updated 6 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Pytorch: Viterbi, Forward-Backward and Baum Welch with a Hidden Markov Model (HMM)☆56Updated 6 years ago
- Simple implementation of Policy Gradient (PG)/ Actor-Critic with keras☆29Updated 7 years ago
- MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitfli…☆20Updated 7 years ago
- PyTorch implementation of both discrete and continuous ACER☆24Updated 6 years ago
- Yet another prioritized experience replay buffer implementation.☆48Updated 2 years ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆66Updated 7 years ago
- Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…☆77Updated 8 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆38Updated 2 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆95Updated 4 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 7 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Updated 6 years ago