ppyht2 / tf-a2c
Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games
☆12Updated 7 years ago
Alternatives and similar repositories for tf-a2c:
Users that are interested in tf-a2c are comparing it to the libraries listed below
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- Implementation of modular composition network from https://arxiv.org/pdf/1711.11289.pdf☆25Updated 7 years ago
- Imagination Augmented Agents TensorFlow☆26Updated 5 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Updated 8 years ago
- ☆56Updated 2 years ago
- Meta Reinforcement Learning Experiments☆34Updated 7 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Updated 7 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆31Updated 8 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- ☆29Updated 6 years ago
- Codebase for Efficient yet simple Reinforcement Learning Research Framework☆28Updated 2 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- Deep Q Network implements by Tensorflow☆25Updated 7 years ago
- Reinforcement Learning for Super Mario Bros using A3C on GPU☆37Updated 6 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆103Updated 5 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆9Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Reinforcement Learning and Deep Learning Resources☆16Updated 6 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 5 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- ☆17Updated 7 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆46Updated 4 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Updated 7 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 6 years ago