MehdiAbbanaBennani / reinforcement-learning-on-blackjackLinks

On-Policy Model-free Reinforcement Learning for simplified Blackjack (David Silver Assignement)

☆10

Alternatives and similar repositories for reinforcement-learning-on-blackjack

Users that are interested in reinforcement-learning-on-blackjack are comparing it to the libraries listed below

Sorting:

veronicachelu / meta-learning
Meta Reinforcement Learning Experiments
☆34Updated 7 years ago
gd-zhang / ACKTR
Actor Critic using Kronecker-Factored Trust Region
☆19Updated 6 years ago
Breakend / OptionGAN
Code accompanying the OptionGAN paper.
☆44Updated 6 years ago
Riashat / Bayesian-Exploration-Deep-RL
Bayesian Uncertainty Exploration in Deep Reinforcement Learning
☆18Updated 7 years ago
kimhc6028 / policy-gradient-importance-sampling
Policy gradient reinforcement learning algorithm with importance sampling
☆32Updated 7 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆95Updated 6 years ago
analog-rl / Duel_DDQN
Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras
☆31Updated 9 years ago
cxxgtxy / deeprl-baselines
Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…
☆35Updated 6 years ago
AutumnWu / Streamlined-Off-Policy-Learning
ICRL 2020
☆19Updated 5 years ago
DongjunLee / dqn-tensorflow
Deep Q Network implements by Tensorflow
☆25Updated 7 years ago
david-abel / rl_abstraction
Code for experimenting with state and action abstractions in reinforcement learning.
☆30Updated 4 years ago
anirudh9119 / rl_adversarial
Learning Backtracking Models, ICLR'19
☆10Updated 7 years ago
TianhongDai / self-imitation-learning-pytorch
This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.
☆66Updated 6 years ago
slowbull / DDPG
Tensorflow implementation of Deep Deterministic Policy Gradients
☆19Updated 8 years ago
angusfung / population-based-training
Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.
☆56Updated 6 years ago
0b01 / CommNet
PyTorch implementation of CommNet
☆36Updated 7 years ago
cmusjtuliuyuan / RainBow
RainBow, Tensorflow
☆49Updated 7 years ago
facebookresearch / modeling_long_term_future
Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future
☆50Updated 6 years ago
geek-ai / 1m-agents
A platform of grid world that supports up to 1 million reinforcement-learning agents.
☆69Updated 7 years ago
senya-ashukha / quantile-regression-dqn-pytorch
A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning
☆95Updated 4 years ago
TomZahavy / CB_AE_DQN
Contextual Bandits Action Elimination DQN
☆21Updated 7 years ago
alok / rl_implementations
☆43Updated 6 years ago
DorianKodelja / DeepMind-Atari-Deep-Q-Learner-2Player
☆13Updated 9 years ago
gopala-kr / DRL-Agents
research and implementations of Deep RL agents and their applications
☆51Updated 3 weeks ago
jsikyoon / a3c-distributed_tensorflow
Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning
☆29Updated 7 years ago
LinZichuan / AdMRL
Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)
☆35Updated 4 years ago
illidanlab / rpg
Ranking Policy Gradient
☆23Updated 5 years ago
kazizzad / BDQN-MxNet-Gluon
Efficient Exploration through Bayesian Deep Q-Networks
☆37Updated 7 years ago
RonanFR / UCRL
☆27Updated 6 years ago
ZhengyaoJiang / NLRL
Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)
☆76Updated 5 years ago