hartikainen / easy21Links

Reinforcement learning agents and environment for Easy21, a modified version of Blackjack

☆14

Alternatives and similar repositories for easy21

Users that are interested in easy21 are comparing it to the libraries listed below

Sorting:

kvfrans / Easy21-RL
solutions to David Silver's RL course project Easy21
☆19Updated 8 years ago
Officium / RL-Experiments
High-quality implementations of deep reinforcement learning algorithms for experiments
☆51Updated 9 months ago
kimhc6028 / policy-gradient-importance-sampling
Policy gradient reinforcement learning algorithm with importance sampling
☆32Updated 7 years ago
Silvicek / distributional-dqn
Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…
☆132Updated 6 years ago
jvmncs / ParamNoise
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆28Updated 6 years ago
Finspire13 / pytorch-policy-gradient-example
A toy example of Policy Gradient implemented in Pytorch
☆93Updated 7 years ago
xinleipan / gym-gridworld
Simple grid-world environment compatible with OpenAI-gym
☆50Updated 5 years ago
flyyufelix / VizDoom-Keras-RL
Reinforcement Learning in Keras on VizDoom
☆143Updated 7 years ago
krfricke / rl-benchmark
Reinforcement learning benchmarking.
☆40Updated 6 years ago
rgilman33 / baselines-A2C
[DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation
☆53Updated 5 years ago
lnpalmer / PPO
PyTorch implementation of Proximal Policy Optimization
☆52Updated 7 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆95Updated 2 years ago
louaaron / GAN-Q-Learning
Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874
☆47Updated 4 years ago
angusfung / population-based-training
Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.
☆56Updated 6 years ago
arnomoonens / yarll
Combining deep learning and reinforcement learning.
☆80Updated 3 years ago
Kaixhin / spinning-up-basic
Basic versions of agents from Spinning Up in Deep RL written in PyTorch
☆203Updated 4 years ago
dai-dao / PPO-Pytorch
Implementation of PPO in Pytorch
☆41Updated 7 years ago
flowersteam / geppg
☆35Updated 6 years ago
gd-zhang / ACKTR
Actor Critic using Kronecker-Factored Trust Region
☆19Updated 6 years ago
TreB1eN / HiddenMarkovModel_Pytorch
Pytorch: Viterbi, Forward-Backward and Baum Welch with a Hidden Markov Model (HMM)
☆56Updated 6 years ago
Alexander-H-Liu / Policy-Gradient-and-Actor-Critic-Keras
Simple implementation of Policy Gradient (PG)/ Actor-Critic with keras
☆29Updated 7 years ago
monoelh / deep-reinforcement-learning_DDQN_PPO_HER
MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitfli…
☆20Updated 7 years ago
dchetelat / acer
PyTorch implementation of both discrete and continuous ACER
☆24Updated 6 years ago
takoika / PrioritizedExperienceReplay
Yet another prioritized experience replay buffer implementation.
☆48Updated 2 years ago
andreimuntean / A3C
Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.
☆66Updated 7 years ago
spiglerg / DQN_DDQN_Dueling_and_DDPG_Tensorflow
Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…
☆77Updated 8 years ago
RobertTLange / spinningup-workspace
Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.
☆38Updated 2 years ago
senya-ashukha / quantile-regression-dqn-pytorch
A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning
☆95Updated 4 years ago
nosyndicate / pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
☆49Updated 7 years ago
jingweiz / pytorch-distributed
Ape-X DQN & DDPG with pytorch & tensorboard
☆102Updated 6 years ago