mynkpl1998 / Recurrent-Deep-Q-Learning
Solving POMDP using Recurrent networks
☆82Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Recurrent-Deep-Q-Learning
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- ☆41Updated 5 years ago
- Experiments with reinforcement learning and recurrent neural networks☆113Updated last year
- Gym-like extensions for POMDP☆56Updated 3 years ago
- ☆71Updated 5 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆117Updated 3 months ago
- ☆118Updated 4 months ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆96Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated last year
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆114Updated 3 weeks ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆82Updated 7 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆48Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆50Updated 3 years ago
- ☆119Updated last year
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆96Updated 5 years ago
- Gridworld for MARL experiments☆137Updated 3 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆32Updated 5 years ago
- Hierarchical-DQN in pytorch (not actively maintained)☆68Updated 7 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆157Updated 2 years ago
- ☆44Updated 3 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆133Updated 5 years ago
- Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.☆199Updated 5 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated last year
- using recurrent networks(LSTM) to solve POMDPs☆35Updated 6 years ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆149Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆96Updated 3 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆102Updated 2 years ago