ppyht2 / tf-a2c
Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games
☆12Updated 6 years ago
Alternatives and similar repositories for tf-a2c:
Users that are interested in tf-a2c are comparing it to the libraries listed below
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Updated 8 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆31Updated 7 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆70Updated 7 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Updated 8 years ago
- ☆32Updated 6 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 7 years ago
- A simple Gridworld environment for Open AI gym☆24Updated 6 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Updated 6 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆45Updated 5 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Updated 7 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- Collaborative Deep Reinforcement Learning☆32Updated 7 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆45Updated 4 years ago
- ☆1Updated 2 years ago
- ☆35Updated 6 years ago
- Implementation of modular composition network from https://arxiv.org/pdf/1711.11289.pdf☆25Updated 7 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env☆71Updated 7 years ago
- ☆25Updated 7 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆43Updated 6 years ago
- Imagination Augmented Agents TensorFlow☆26Updated 4 years ago
- Meta Reinforcement Learning Experiments☆33Updated 7 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 7 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 5 years ago
- Model-Free Episodic Control☆15Updated 8 years ago