MehdiAbbanaBennani / reinforcement-learning-on-blackjack
On-Policy Model-free Reinforcement Learning for simplified Blackjack (David Silver Assignement)
☆11Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for reinforcement-learning-on-blackjack
- State Space Models for Reinforcement Learning in Tensorflow☆18Updated 5 years ago
- Meta Reinforcement Learning Experiments☆33Updated 7 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆31Updated 7 years ago
- Contextual Bandits Action Elimination DQN☆19Updated 6 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- A simple Gridworld environment for Open AI gym☆24Updated 6 years ago
- Reinforcement Learning and Deep Learning Resources☆16Updated 6 years ago
- Tensorflow implementation of Deep Deterministic Policy Gradients☆20Updated 7 years ago
- PyTorch implementation of various reinforcement learning algorithms☆19Updated 6 years ago
- SeqGAN but with more bells and whistles☆24Updated 6 years ago
- Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)☆74Updated 4 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Updated 6 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 6 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆46Updated 3 years ago
- ICRL 2020☆18Updated 4 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Updated 4 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆30Updated 5 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- Deep Q Network implements by Tensorflow☆25Updated 6 years ago
- 课程笔记,David Silver,CS294 ...☆15Updated 5 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆33Updated 8 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- Code implementation of: "Graying the black box: Understanding DQNs"☆20Updated 7 years ago
- An implementation of DIP-VAE from the paper "Variational Inference of Disentangled Latent Concepts from Unlabelled Observations" by Kumar…☆24Updated 6 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆131Updated 5 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago