hongzimao / a3cLinks

Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".

☆25

Alternatives and similar repositories for a3c

Users that are interested in a3c are comparing it to the libraries listed below

Sorting:

PKU-RL / FEN
FEN Code
☆38Updated 5 years ago
ASzot / ppo-pytorch
Proximal policy optimization in PyTorch. Easy to read and understand.
☆49Updated 4 years ago
wwxFromTju / deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…
☆128Updated 2 years ago
louaaron / GAN-Q-Learning
Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874
☆47Updated 4 years ago
liampetti / DDPG
Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…
☆64Updated 8 years ago
yilunc2020 / Attention-DQN
Deep Recurrent Attention Reinforcement Learning in Atari
☆85Updated 6 years ago
shakti365 / soft-actor-critic
TF2 Implementation of the Soft Actor-Critic Algorithm
☆44Updated 2 years ago
xlnwel / model-free-algorithms
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
☆61Updated 4 years ago
ying-wen / malib_deprecated
A Multi-agent Learning Framework
☆62Updated 4 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆94Updated 2 years ago
skumar9876 / Hierarchical-DQN
Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…
☆85Updated 7 years ago
ewanlee / ICLR2019-RL-Papers
The Reinforcement-Learning-Related Papers of ICLR 2019
☆47Updated 6 years ago
agakshat / maddpg
Implementation of Multi-Agent Deep Deterministic Policy Gradients
☆38Updated 7 years ago
MG2033 / A2C
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
☆180Updated 6 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆72Updated 8 years ago
cxxgtxy / deeprl-baselines
Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…
☆35Updated 6 years ago
lnpalmer / PPO
PyTorch implementation of Proximal Policy Optimization
☆51Updated 7 years ago
mengf1 / DHER
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
☆66Updated 5 years ago
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆125Updated 5 years ago
TianhongDai / distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
☆62Updated 6 years ago
tegg89 / magnet
MAGNet: Multi-agents control using Graph Neural Networks
☆132Updated 6 years ago
facebookresearch / CollaQ
A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"
☆131Updated last year
xinleipan / gym-gridworld
Simple grid-world environment compatible with OpenAI-gym
☆50Updated 5 years ago
tesslerc / H-DRLN
Hierarchical Deep RL Network
☆31Updated 8 years ago
jcwleo / mario_rl
☆69Updated 6 years ago
TianhongDai / self-imitation-learning-pytorch
This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.
☆66Updated 6 years ago
saizhang0218 / VBC
pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"
☆53Updated 2 years ago
stefanbo92 / A3C-Continuous
Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space
☆46Updated 7 years ago
rrmenon10 / Bootstrapped-DQN
Tensorflow implementation of BootstrappedDQN using OpenAI baselines
☆19Updated 4 years ago
hsvgbkhgbv / SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …
☆119Updated 7 months ago