MehdiAbbanaBennani / reinforcement-learning-on-blackjack
On-Policy Model-free Reinforcement Learning for simplified Blackjack (David Silver Assignement)
☆10Updated 7 years ago
Alternatives and similar repositories for reinforcement-learning-on-blackjack:
Users that are interested in reinforcement-learning-on-blackjack are comparing it to the libraries listed below
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Contextual Bandits Action Elimination DQN☆19Updated 6 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆31Updated 7 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- Meta Reinforcement Learning Experiments☆33Updated 7 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)☆75Updated 5 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 5 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆34Updated 3 years ago
- State Space Models for Reinforcement Learning in Tensorflow☆19Updated 6 years ago
- PyTorch implementation of various reinforcement learning algorithms☆18Updated 6 years ago
- SeqGAN but with more bells and whistles☆24Updated 6 years ago
- Deep Q Network implements by Tensorflow☆25Updated 6 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆92Updated 6 years ago
- ICRL 2020☆19Updated 4 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- PyTorch implementation of Memory Augmented Self-Play☆50Updated 4 years ago
- 课程笔记,David Silver,CS294 ...☆15Updated 6 years ago
- Random memory adaptation model inspired by the paper: "Memory-based parameter adaptation (MbPA)"☆24Updated 6 years ago
- ☆43Updated 5 years ago
- ZForcing Repo☆40Updated 7 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 7 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- An optimized version of SeqGAN in pytorch☆12Updated 6 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago