hongzimao / a3cLinks
Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
☆25Updated 8 years ago
Alternatives and similar repositories for a3c
Users that are interested in a3c are comparing it to the libraries listed below
Sorting:
- FEN Code☆38Updated 5 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆49Updated 4 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆128Updated 2 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Updated 4 years ago
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆64Updated 8 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆85Updated 6 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 2 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆61Updated 4 years ago
- A Multi-agent Learning Framework☆62Updated 4 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆94Updated 2 years ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆85Updated 7 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 6 years ago
- Implementation of Multi-Agent Deep Deterministic Policy Gradients☆38Updated 7 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆180Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 8 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 6 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆125Updated 5 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- MAGNet: Multi-agents control using Graph Neural Networks☆132Updated 6 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆131Updated last year
- Simple grid-world environment compatible with OpenAI-gym☆50Updated 5 years ago
- Hierarchical Deep RL Network☆31Updated 8 years ago
- ☆69Updated 6 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"☆53Updated 2 years ago
- Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space☆46Updated 7 years ago
- Tensorflow implementation of BootstrappedDQN using OpenAI baselines☆19Updated 4 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆119Updated 7 months ago