sufengniu / GVIN
Generalized Value Iteration Network
☆23Updated 5 years ago
Related projects: ⓘ
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆86Updated 6 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆64Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆92Updated 2 years ago
- Tensorflow implementation of BootstrappedDQN using OpenAI baselines☆19Updated 3 years ago
- ☆80Updated 5 years ago
- Hierarchical Deep RL Network☆29Updated 7 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆95Updated 6 years ago
- An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161☆179Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆48Updated 5 years ago
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆36Updated 3 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Updated 6 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Updated 6 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆101Updated 5 years ago
- Source code for our NIPS 2017 paper, InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations☆40Updated 6 years ago
- Proximal Policy Optimization in PyTorch☆39Updated 6 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Updated 4 years ago
- Adversarial Imitation Via Variational Inverse Reinforcement Learning☆93Updated 4 years ago
- ☆111Updated 5 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Updated 7 years ago
- NIPS 2017 Value Prediction Network☆165Updated 6 years ago
- ☆28Updated last year
- ☆95Updated last year
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆35Updated 5 years ago
- ☆90Updated 9 months ago
- FEN Code☆36Updated 4 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆129Updated last year
- ☆59Updated 6 years ago
- Official implementation of ICML paper Imitating Latent Policies from Observation☆73Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆28Updated 5 years ago