Mostafa-Samir / 2048-RL-DRQN
An attempt at applying Deep RL on the board game 2048
☆16Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for 2048-RL-DRQN
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆31Updated 6 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆43Updated 6 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated last year
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆66Updated 6 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- Using Asynchronous Deep Reinforcement Learning to play Flappy Bird from pixel input.☆30Updated 7 years ago
- ☆56Updated 6 years ago
- Tensorflow implementation of A3C algorithm☆48Updated 7 years ago
- Combining deep learning and reinforcement learning.☆81Updated 3 years ago
- PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)☆79Updated 7 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Updated 7 years ago
- Yet another prioritized experience replay buffer implementation.☆48Updated 2 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 7 years ago
- PyTorch implementation of Memory Augmented Self-Play☆50Updated 4 years ago
- A PyTorch implementation of Rainbow DQN agent☆165Updated 6 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 6 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆101Updated 5 years ago
- Keras implementation of DQN on ViZDoom environment☆53Updated 8 years ago
- ☆39Updated 7 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- ☆53Updated 7 years ago
- Collaborative Deep Reinforcement Learning☆32Updated 7 years ago
- DDPG on OpenAI Gym Pendulum☆19Updated 8 years ago
- ☆57Updated last year
- Implementation of A Distributional Perspective on Reinforcement Learning☆35Updated 7 years ago
- Direct Future Prediction (DFP ) in Keras☆109Updated 6 years ago
- ☆46Updated 6 years ago