hongzimao / a3cLinks
Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
☆25Updated 8 years ago
Alternatives and similar repositories for a3c
Users that are interested in a3c are comparing it to the libraries listed below
Sorting:
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Updated 7 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆130Updated 2 years ago
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆64Updated 8 years ago
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 6 years ago
- ☆77Updated 8 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Updated 4 years ago
- MAGNet: Multi-agents control using Graph Neural Networks☆132Updated 6 years ago
- Hierarchical Deep RL Network☆31Updated 8 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated last year
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆183Updated 7 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆152Updated 2 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆132Updated 8 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆74Updated 8 years ago
- reproduce some RL or Multi-Agent models☆35Updated 6 years ago
- ☆69Updated 7 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆95Updated 3 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆63Updated 7 years ago
- Code for hierarchical imitation learning and reinforcement learning☆300Updated 7 years ago
- Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action☆115Updated 7 years ago
- Notes and comments about Deep Reinforcement Learning papers☆77Updated 8 years ago
- Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…☆79Updated 8 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆106Updated 5 years ago
- implement of prioritized experience replay☆159Updated 7 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆127Updated 5 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Updated 5 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 3 years ago
- Implementation of Inverse Reinforcement Learning Algorithm on a toy car in a 2D world problem, (Apprenticeship Learning via Inverse Reinf…☆177Updated 4 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 7 years ago
- FEN Code☆40Updated 6 years ago