MehdiAbbanaBennani / reinforcement-learning-on-blackjack
On-Policy Model-free Reinforcement Learning for simplified Blackjack (David Silver Assignement)
☆11Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for reinforcement-learning-on-blackjack
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆17Updated 7 years ago
- Contextual Bandits Action Elimination DQN☆19Updated 6 years ago
- Meta Reinforcement Learning Experiments☆33Updated 7 years ago
- PyTorch implementation of various reinforcement learning algorithms☆19Updated 6 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 4 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆31Updated 7 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- State Space Models for Reinforcement Learning in Tensorflow☆17Updated 5 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 6 years ago
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆32Updated last year
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆33Updated 3 years ago
- SeqGAN but with more bells and whistles☆24Updated 6 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆29Updated 2 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆32Updated 8 years ago
- some common TD Learning algorithms☆67Updated 4 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 5 years ago
- Ranking Policy Gradient☆23Updated 4 years ago
- Tensorflow Implementation for "Noisy network for exploration"☆33Updated 7 years ago
- ☆42Updated 5 years ago
- PyTorch code to train and evaluate Procgen tasks☆23Updated 4 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 3 years ago
- IRL implementation based on Norvig's AIMA code.☆14Updated 10 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- ☆14Updated 8 years ago