yilunc2020 / Attention-DQN
Deep Recurrent Attention Reinforcement Learning in Atari
☆82Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Attention-DQN
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆131Updated last year
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆114Updated 2 weeks ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆54Updated last year
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆131Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆70Updated 7 years ago
- ☆83Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- ☆77Updated 6 years ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆81Updated 7 years ago
- Code for paper 'Learning transferable cooperative behaviors in multi-agent teams' (ICML 2019)☆108Updated last year
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆66Updated 4 years ago
- An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161☆178Updated 7 years ago
- Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action☆112Updated 6 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆61Updated 6 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- Code for hierarchical imitation learning and reinforcement learning☆286Updated 6 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆47Updated 6 years ago
- Adversarial Imitation Via Variational Inverse Reinforcement Learning☆95Updated 4 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆101Updated 5 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 4 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆36Updated 5 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 5 years ago
- Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.☆199Updated 5 years ago
- ☆119Updated last year
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆65Updated 7 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆180Updated 6 years ago
- research and implementations of Deep RL agents and their applications☆47Updated 3 weeks ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago