hartikainen / easy21
Reinforcement learning agents and environment for Easy21, a modified version of Blackjack
☆14Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for easy21
- solutions to David Silver's RL course project Easy21☆19Updated 8 years ago
- Assignments for CS294-112 Fall2018 in Pytorch☆63Updated 6 years ago
- My solution to assignments in UC Berkeley CS294-112: Deep Reinforcement Learning☆90Updated 5 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆47Updated 6 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- Solutions to the Deep RL Bootcamp labs☆44Updated 7 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆44Updated 6 years ago
- A simple Gridworld environment for Open AI gym☆24Updated 6 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆101Updated 4 years ago
- My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow☆55Updated 2 years ago
- TensorFlow implementation of Deep Reinforcement Learning papers☆29Updated 7 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 6 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 6 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆17Updated 7 years ago
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆19Updated 6 years ago
- Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).☆88Updated 5 years ago
- tensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/☆31Updated 4 years ago
- Implementation of Deep/Double Deep/Dueling Deep Q networks for playing Atari games using Keras and OpenAI gym☆40Updated 6 years ago
- Pong AI trained using policy gradient-based reinforcement learning☆51Updated last year
- C implementation of RL and IRL algorithms☆19Updated 4 years ago
- Pytorch Implementation of Proximal Policy Optimization Algorithm☆20Updated 6 years ago
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆65Updated 7 years ago
- Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…☆74Updated 7 years ago
- Solving easy21 assigment from RL class by David Silver; A practical guide to get started with RL for beginners.☆18Updated 5 years ago
- MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitfli…☆19Updated 6 years ago
- Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.☆101Updated 4 years ago
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆53Updated 4 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 4 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 5 years ago
- ☆29Updated last year