ppyht2 / tf-a2c
Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games
☆13Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for tf-a2c
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆31Updated 6 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 6 years ago
- ☆32Updated 6 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆22Updated 7 years ago
- Inferring beliefs about dynamics from behavior☆28Updated 6 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Updated 7 years ago
- A simple Gridworld environment for Open AI gym☆24Updated 6 years ago
- Deep Q Network implements by Tensorflow☆25Updated 6 years ago
- A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env☆71Updated 7 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 6 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆30Updated 5 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆9Updated 4 years ago
- Reinforcement learning benchmarking.☆39Updated 6 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Updated 7 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- This is my implementation of the Optimality Tightening☆37Updated 7 years ago
- Imagination Augmented Agents TensorFlow☆26Updated 4 years ago
- ☆35Updated 6 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆103Updated 5 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆32Updated 8 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Updated 7 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- An implementation of the Deep Deterministic Policy Gradient (DDPG) algorithm using Keras/Tensorflow with the robot simulated using ROS/Ga…☆61Updated 7 years ago